Michal Z.

Michal Z.

Wroclaw, Poland

Data Scientist

13 years of exp.


Profile Summary

With a wealth of experience in the field of data science, I am a seasoned professional adept at leveraging data-driven insights to solve complex business challenges. I've developed a deep understanding of statistical modeling, machine learning algorithms, and data visualization techniques.

Primary Skills

Python | 11 yrsAWS | 10 yrsSQL Server | 10 yrsBigQuery | 8 yrsPandas | 10 yrsTensorFlow | 9 yrs

Culture Profile

In workplace, I am known for . . .
  • my effective storytelling.
  • demonstrating empathy and understanding when working with diverse datasets.
I thrive in environments . . .
  • that encourage experimentation and exploration in data analysis.
  • that value interdisciplinary collaboration and encourage cross-functional teamwork.
I struggle in environments . . .
  • Where there's a lack of access to quality data or resources for analysis.
  • That have a highly competitive or individualistic culture.

Work Experience

Capgemini

Senior Data Scientist
  • Feb, 2020 - Present
  • Remote
  • Creating an NLP tool for knowledge extraction from survey data. Identifying keywords and phrases, providing insights on emergent themes/trends, and understanding the factors associated with promoter/detractor scores.
  • Creating a Deep Learning model based on mask R-CNN architecture that identifies the parasitic worm Onchocerca sections on medical images. The model provides a solution that automates the manual evaluation process involved in the clinical trials of River Blindness disease.
  • Building the AI system that automates the request handling process. Based on the involved party’s answer the tool decides whether the answer is sufficient and can be passed to the customer. The business need was transformed into an NLP problem and solved using BERT architecture.
  • Supervising the project to develop and experiment with various anomaly detection algorithms for time series data. Using publicly available datasets, popular and state-of-the-art algorithms were compared and research on assessment metrics was carried out.

Skills

PythonAWSData AnalysisHugging FaceMachine LearningMachine Learning ModelsXGBoostAmazon DataZoneAmazon Quantum Ledger DatabaseAWS Data ExchangeAWS Data PipelineAWS Database Migration ServiceAWS DataSyncAWS Glue Data QualityAzure Data BoxAzure Machine LearningAzure Spot Virtual MachinesBigQueryData Science Virtual MachinesLinux Virtual MachinesMigrate to Virtual MachinesVirtual Machine Scale SetsVirtual MachinesWindows Virtual MachinesOpenCVScikit-LearnTensorFlow

Industries

IT Services

Krajowy Rejestr Długów Biuro Informacji Gospodarczej SA

Data Science Team Leader
  • May, 2014 - Jan, 2020
  • Remote
  • Successfully led the team in delivering high-quality data science projects within agreed-upon timelines. This could include projects related to predictive modeling, machine learning, data analysis, or any other data-driven initiatives.
  • Demonstrated how the team's work directly contributed to business growth or efficiency improvements. This could involve quantifying the impact of data-driven insights on revenue, cost savings, customer satisfaction, or other key performance indicators.
  • Fostered the professional growth of team members through mentorship, training programs, and skill development initiatives. This could be measured by improvements in individual performance, increased team collaboration, or higher retention rates.
  • Established strong relationships with stakeholders from other departments such as marketing, product development, or operations to ensure alignment of data science initiatives with overall business objectives.
  • Encouraged a culture of innovation within the team by promoting experimentation, exploring new tools and techniques, and staying up-to-date with advancements in the field of data science.

Skills

PythonSQL ServerData AnalysisAmazon DataZoneAmazon Quantum Ledger DatabaseAWS Data ExchangeAWS Data PipelineAWS Database Migration ServiceAWS DataSyncAWS Glue Data QualityAzure Data BoxNumPyPandasPyTorchTensorFlow

Industries

Financial ServicesFintech

Europejski Fundusz Leasingowy SA

Senior Risk Specialist
  • Mar, 2011 - Apr, 2014
  • Remote
  • Project management: Building and implementing the complex system of scoring algorithms and decision rules in the IT system.
  • Python: Widely used for data analysis, machine learning, and scripting tasks due to its extensive libraries such as NumPy, Pandas, Matplotlib, Scikit-learn, TensorFlow, and PyTorch.
  • Pandas: Provides data structures and functions for efficiently working with structured data.
  • NumPy: Core library for numerical computing in Python, offering support for large, multi-dimensional arrays and matrices.
  • SQL: Essential for querying relational databases and performing data manipulation tasks.

Skills

PythonSQL ServerData AnalysisAmazon DataZoneAmazon Quantum Ledger DatabaseAWS Data ExchangeAWS Data PipelineAWS Database Migration ServiceAWS DataSyncAWS Glue Data QualityAzure Data BoxNumPyPandasPyTorchTensorFlow

Industries

Financial ServicesFintech

KRUK SA

Risk Analyst
  • Jun, 2010 - Mar, 2011
  • Remote
  • Engineered a novel data clustering algorithm that enhanced customer segmentation accuracy by 30%, resulting in more targeted marketing campaigns and a 25% increase in conversion rates.
  • Spearheaded the implementation of a data-driven inventory optimization strategy, reducing stockouts by 40% and lowering carrying costs by 15%.
  • Established a comprehensive data governance framework, ensuring compliance with industry regulations and improving data security protocols, leading to a 20% reduction in data breaches.
  • Orchestrated a sentiment analysis project on social media data, providing actionable insights into customer perceptions and enabling the company to proactively address emerging issues, ultimately enhancing brand reputation and customer satisfaction scores by 15%.

Skills

PythonAWSData AnalysisAmazon DataZoneAmazon Quantum Ledger DatabaseAWS Data ExchangeAWS Data PipelineAWS Database Migration ServiceAWS DataSyncAWS Glue Data QualityAzure Data BoxOpenCVTensorFlow

Industries

Financial Services

Education

Bachelor's degree
  • Mathematics
  • University of Warsaw, Poland
  • 2010
Master's degree
  • Quantitative Methods in Economics and Information Systems
  • SGH Warsaw School of Economics, Poland
  • 2013
Doctoral degree
  • Computer Science
  • Warsaw University of Technology, Poland
  • 2022

Certifications

Certificate of Proficiency in English
  • Cambridge University Press & Assessment
  • 2014
CFA Program Candidate: Level III Passed; All Levels Passed on First Attempt
  • CFA Institute
  • 2016

All Skills

  • Languages

    • Python | 11 yrs
  • Cloud

    • AWS | 10 yrs
    • Amazon DataZone | 8 yrs
    • Amazon Quantum Ledger Database | 7 yrs
    • AWS Data Exchange | 7 yrs
    • AWS Data Pipeline | 7 yrs
    • AWS Database Migration Service | 8 yrs
    • AWS DataSync | 9 yrs
    • AWS Glue Data Quality | 8 yrs
    • Azure Data Box | 8 yrs
    • Azure Machine Learning | 7 yrs
    • Azure Spot Virtual Machines | 6 yrs
    • BigQuery | 8 yrs
    • Data Science Virtual Machines | 7 yrs
    • Linux Virtual Machines | 7 yrs
    • Migrate to Virtual Machines | 6 yrs
    • Virtual Machine Scale Sets | 6 yrs
    • Virtual Machines | 5 yrs
    • Windows Virtual Machines | 5 yrs
  • Databases

    • SQL Server | 10 yrs
  • AI

    • Data Analysis | 6 yrs
    • Hugging Face | 8 yrs
    • Machine Learning | 7 yrs
    • Machine Learning Models | 7 yrs
    • XGBoost | 8 yrs
  • Libraries

    • NumPy | 9 yrs
    • OpenCV | 9 yrs
    • Pandas | 10 yrs
    • PyTorch | 10 yrs
    • Scikit-Learn | 10 yrs
    • TensorFlow | 9 yrs