top of page
Screen Shot 2021-12-22 at 6.55.36 PM.png

Dr Dilek Celik
Sr Data Scientist

SKILLS

Python | SQL | Keras | Tensorflow | SciKit-Learn | Pandas | NumPy | SciPy | Matplotlib | Seaborn | ggplot2 | Jupyter Notebook | R Markdown | Advanced Statistics (SPSS, A/B Testing) | Hadoop | Cloud (AWS, Azure) | Tableau

Data Science & Machine Learning Projects

Fraud Detection

Unknown-8.png
  • Build and maintain Logistic Regression, Random Forest, Deep Neural Network models

  • Performed Exploratory Data Analysis (EDA), Statistical Data Analysis, Data Cleaning, Handle Missing Data, Outliers Check, Feature Engineering, Data Pre-processing (Train-test split, Scaling, SMOTE), Built Models, Cross-Validation, Evaluation with confusion_matrix and classification_report, Compare Models, Predictions, Model Deployment 

  • Deployed Logistic Regression Model with 95% accuracy score using Flask on AWS cloud

Screen Shot 2021-12-23 at 12.44.12 PM.png

Regression Project - Car Price Prediction

Unknown-11.png
  • Build Linear Regression, Ridge Regression, Lasso Regression, Elastic-Net algorithms

  • Performed end to end Data Science steps, Exploratory Data Analysis, Quantitative Data Analysis, Data Cleaning, Feature Engineering, Multicollinearity Check, Detect Outliers, Pre-Processing, Implement Models, Cross-Validation, Feature Importance, Compare Models, Save Model, Predictions

  • Building model using pandas, numpy, matplotlib, seaborn, sklearn, scipy, cross_validate. 

  • Evaluated models using the following metrics mean_absolute_error, mean_squared_error, r2_score, root mean squared error

Screen Shot 2021-12-23 at 12.44.12 PM.png

Customer Segmentation Cluster Analysis with Un-supervised Learning

Unknown-13.png
  • Built K Means Clustering, Hierarchical Clustering (AgglomerativeClustering) models

  • Performed Exploratory Data Analysis (EDA), Quantitative Data Analysis, Data Cleaning, Detect Missing Values and Outliers, Outliers removal with IQR and Zscore, Data Pre-Processing, Cluster Analysis, Built Models 

  • Used tools are pandas, numpy, sklearn, scipy, matplotlib, seaborn, yellowbrikcs. 

  • Performed Cluster Analysis with Hopkins, Elbow Method, Silhouette Score, Dendogram.

Screen Shot 2021-12-23 at 12.44.12 PM.png
bottom of page