Edina B.

Edina B.

Budapest, Hungary

Data Scientist

11 years of exp.


Profile Summary

I am an experienced Senior Data Scientist with a track record of leading end-to-end projects. Collaborates closely with cross-functional teams to deliver actionable insights, driving strategic decision-making processes effectively.

Primary Skills

Python | 9 yrsSQL Server | 9 yrsLinux Virtual Machines | 9 yrsPostgreSQL | 7 yrsDjango | 7 yrsScala | 7 yrsPyTorch | 6 yrs

Culture Profile

In workplace, I am known for . . .
  • demonstrating strong critical thinking and problem-solving skills.
  • effectively managing and prioritizing multiple projects.
I thrive in environments . . .
  • that prioritize the ethical implications of data analysis and decision-making.
  • where there's a culture of mentorship and knowledge sharing.
I struggle in environments . . .
  • where there's a lack of collaboration or communication between teams.
  • where there's pressure to prioritize speed over accuracy in data analysis.

Work Experience

Jabil

Senior Data Scientist
  • Mar, 2022 - Present
  • Remote
  • In-circuit testing project to detect anomalies in test probe measurements in Printed Circuit Board assembly production on multiple production sites.
  • Gathering data from various sources such as databases, APIs, files, or web scraping. Cleaning and preprocessing data to ensure its quality and readiness for analysis.
  • Exploring datasets using statistical techniques and visualization tools to understand patterns, trends, and relationships within the data.
  • Building and fine-tuning machine learning models for various tasks such as classification, regression, clustering, or recommendation systems.

Skills

PythonData AnalysisAmazon DataZoneAmazon Quantum Ledger DatabaseAWS Data ExchangeAWS Data PipelineAWS Database Migration ServiceAWS DataSyncAWS Glue Data QualityAzure Data BoxDynamoDBPostgreSQLDjangoPyTorch

Industries

Manufacturing

Nokia

Data Scientist
  • Jun, 2021 - Mar, 2022
  • Remote
  • Building data science applications on Supply Chain data. Unsurprisingly, I built models for scoring the mobile networking device components to evaluate their supply risks in this COVID era.
  • Gathered data from various sources, including databases, APIs, web scraping, and data lakes. This often involves understanding the nature of the data and its relevance to the problem at hand.
  • Processed raw data to identify and handle missing values, outliers, and inconsistencies. This step is crucial for ensuring the quality and integrity of the data.
  • Conducted thorough exploratory analysis to understand the underlying patterns, relationships, and trends in the data.
  • Created new features or transformed existing ones to improve the performance of machine learning models.

Skills

PythonSQL ServerData AnalysisAmazon DataZoneAmazon Quantum Ledger DatabaseAWS Data ExchangeAWS Data PipelineAWS Database Migration ServiceAWS DataSyncAWS Glue Data QualityAzure Data BoxApache Spark

Industries

Telecommunications

DATAPAO

Data Scientist
  • Jan, 2013 - Sep, 2018
  • Remote
  • I transformed and generated features that can predict the process quality, did the model selection and validation, and communicated our results to the stakeholders including the CEO of the factory.
  • Exploratory Data Analysis of agricultural manufacturing using the data coming from industrial machines and sensors: This was the largest sunflower oil production facility in the region and I delivered it as a part of my master's thesis.
  • I combined big data, data science, and 6-Sigma tools for one of the largest factories in the region to help Datapao establish a predictive statistical process control at scale.
  • During the first phase of the project, I designed the alerting rules, modeled them, and wrote the ETL in Python and Spark on Databricks.
  • I have created official Databricks training materials for Data Engineers and Analysts covering engineering topics in Manufacturing, Retail, Finance, and Healthcare domains. I have also simulated and generated their domain-specific and unique data sets.

Skills

PythonSQL ServerData AnalysisAmazon DataZoneAmazon Quantum Ledger DatabaseAWS Data ExchangeAWS Data PipelineAWS Database Migration ServiceAWS DataSyncAWS Glue Data QualityAzure Data BoxAzure DatabricksLinux Virtual MachinesApache SparkJupyter Notebook/JupyterLab

Industries

Data and Analytics

Education

Bachelor's degree
  • Business Administration, Management and Operations
  • Dokuz Eylul University, Turkey
  • 2013
Master's degree
  • Business Analytics
  • Central European University, Austria
  • 2017

Certifications

AWS Certifications Cloud Practitioner
  • Amazon Web Services (AWS)
  • 2022
Correlation and Regression
  • DataCamp
  • 2020
Supervised Learning in R
  • DataCamp
  • 2020
Experimental Design in R
  • DataCamp
  • 2020
Exploratory Data Analysis
  • DataCamp
  • 2019

All Skills

  • Languages

    • Java | 7 yrs
    • Python | 9 yrs
    • Scala | 7 yrs
  • Cloud

    • Azure | 7 yrs
    • Amazon DataZone | 6 yrs
    • Amazon Quantum Ledger Database | 6 yrs
    • AWS Data Exchange | 6 yrs
    • AWS Data Pipeline | 6 yrs
    • AWS Database Migration Service | 5 yrs
    • AWS DataSync | 5 yrs
    • AWS Glue Data Quality | 5 yrs
    • Azure Data Box | 5 yrs
    • Azure Databricks | 8 yrs
    • Linux Virtual Machines | 9 yrs
  • Databases

    • SQL Server | 9 yrs
    • DynamoDB | 7 yrs
    • MySQL | 6 yrs
    • PostgreSQL | 7 yrs
  • AI

    • Data Analysis | 6 yrs
  • Frameworks

    • Django | 7 yrs
  • Libraries

    • Apache Spark | 6 yrs
    • PyTorch | 6 yrs
  • Tools

    • Jupyter Notebook/JupyterLab | 6 yrs