Statistical Data Analyst & Machine Learning Engineer

Transforming complex datasets into measurable business insights through statistical modeling, predictive analytics, and structured experimentation.

About Me

I am a Data Engineering and Machine Learning enthusiast, currently a graduate with a Master's degree in Statistical Data Analytics and Computing Sciences, with a strong interest in transforming complex data into meaningful insights. My expertise includes statistical analysis, feature engineering, classification modelling, and evaluation using structured experimentation and validation techniques.

I enjoy exploring new technologies, improving my technical skills, and developing data-driven solutions. Beyond my professional interests, I enjoy spending time in nature, exploring new places, and experiencing local wildlife and cultures, especially the local cuisine, which explains my personal motivation towards moving to Finland. Being in Finland has allowed me to grow not only as a data professional but also as someone who appreciates how technology and quality of life can coexist.

Selected Projects

ML Churn Dashboard

End-to-end churn prediction system with feature engineering, classification modeling, and dashboard-driven business insights.

Logistic Regression · Random Forest · ROC-AUC View Project →

Credit Card Fraud Detection

Fraud detection model optimized for high recall in highly imbalanced datasets. Applied precision-recall tradeoff analysis to reduce financial risk.

Imbalanced Learning · F1 Score · Precision/Recall View Project →

ML Analysis – Wine Dataset

Comparative classification study with feature selection, cross-validation, and performance benchmarking.

Model Comparison · Cross Validation View Project →

Object Detection

Implemented computer vision detection pipeline using Python, exploring feature extraction and detection accuracy.

Computer Vision · Image Processing View Project →

PCA & Model Selection

Applied dimensionality reduction techniques to improve model efficiency and compared multiple classifiers for optimal performance.

PCA · Model Optimization · Pipeline Design View Project →

Core Skills

Python
Statistical Analysis
Machine Learning
Scikit-Learn
Pandas & NumPy
Feature Engineering
Model Evaluation
Data Visualization
Git & GitHub

Contact

Email: surajsharma1921@gmail.com

GitHub: github.com/Suraj192

LinkedIn: View Profile