SHARMA Suraj

Aspiring Data Engineer | Machine Learning Enthusiast | Data Analyst

Profile

Data professional specializing in data-driven systems and machine learning applications, with a strong analytical and problem-solving mindset. Experienced in translating complex data into actionable insights and collaborating on technical solutions in team environments. Adaptable and quick to learn, with a proactive approach and a strong interest in applying skills to real-world challenges and contributing to impactful projects.

Education

Master’s Degree in Statistical Data Analytics and Computing Science – Tampere University

2022 – 2026

  • Thesis: ML-based control factor identification in wastewater treatment Plant
  • Relevant coursework: Machine Learning, Data Engineering, Statistics

Projects

Wastewater Treatment Plant Optimization (Thesis)

  • Developed ML models to identify key operational control variables
  • Applied clustering (SOM) and similarity metrics for pattern analysis
  • Characterization of clustered classes according to process domains
  • Improved system understanding and decision-making support

ML Customer Churn Dashboard

  • Developed a machine learning model to predict customer churn and identify retention risks
  • Built an interactive dashboard to visualize churn patterns and key influencing factors
  • Integrated model outputs with user-friendly visualizations for business decision support

Credit Card Fraud Detection

  • Built a classification model to detect fraudulent transactions in highly imbalanced datasets
  • Performed data preprocessing, feature engineering, and model evaluation to improve detection accuracy
  • Applied anomaly detection techniques to identify suspicious transaction patterns

Experience

Research Assistant (Master's Thesis Collaboration)

Ramboll

2025-2026

  • Conducting applied research on wastewater treatment optimization using machine learning
  • Collaborating with industry professionals to align research with real-world engineering needs
  • Analyzing operational data to identify key control variables affecting system performance
  • Developing and evaluating models to support data-driven decision-making

Data Science Trainee (Intensive Program)

Integrify, Helsinki

Oct 2020 – Mar 2021

  • Completed a 6-month intensive training program in data science and machine learning engineering
  • Developed hands-on projects involving data preprocessing, model building, and evaluation
  • Worked in team-based environments simulating real-world industry workflows
  • Collaborated in team-based projects simulating industry workflows

Contact

Email: surajsharma1921@gmail.com

GitHub: https://github.com/Suraj192

LinkedIn: https://www.linkedin.com/in/suraj-sharma-a54060127/

Technical Skills

Data Engineering: Python, SQL, ETL Pipelines, Real-time data Processing, Scala

Machine Learning: Scikit-learn, Feature Engineering, Model Evaluation

Data Analysis: EDA, Data Visualization, Power BI

Core Strengths

• Analytical & Problem Solving

• Teamwork & Collaboration

• Customer-Oriented Thinking

• Adaptability & Fast Learning

Tools

Git
Docker
Streamlit
Power BI
Jupyter Notebook

Languages

English – Professional

Finnish - Conversational