Hi, I'm Nikhil Goutham

Data Scientist with expertise in Data Analysis, Machine Learning, Deep Learning and AI

Who is this guy?

Data Scientist with over 3 years of experience in building machine learning models, developing data pipelines, and extracting insights from complex datasets. Expertise in supervised and unsupervised learning, deep learning, and natural language processing.

Skilled in Python, SQL, and cloud-based data engineering solutions. Proven ability to design scalable AI models, optimize ETL workflows, and deploy data-driven solutions that enhance business decision-making.

Profile

Contents

Education

Master of Science in Data Science

New Jersey Institute of Technology (NJIT)

Jan'23 - Dec'24

GPA: 3.95

Specialized in advanced analytics, machine learning, and data engineering

Focused on developing scalable solutions for real-world data challenges

Applied AI/ML techniques to solve complex business problems

Relevant Coursework:

Big DataMachine LearningDeep LearningCloud ComputingData VisualizationData MiningStatisticsR

Bachelor of Technology in Mechanical Engineering

BML Munjal University, New Delhi, India

Aug'17-Aug'21

GPA: 3.5

Received academic scholarship for outstanding performance

Sports Coordinator, Hero Challenge Fest (Jan-Feb 2018)

Sports Representative Head, Banyan League (Jan-Feb 2019)

Key Achievements:

Managed logistics for multiple teams, overseeing transportation, deliveries, inventory, and supply chain processes

Collaborated with cross-functional stakeholders to optimize workflows and enhance team productivity

That One Excel Sheet

Now you might wonder, why the heck did I shift my career from mechanical to data?

Ngl, that's what my parents wondered too.

It all started with that one Excel sheet. That one regression equation.

During my internships, I worked on projects involving smart manufacturing and energy optimization, where I eventually had to use data to improve efficiency. That's when it clicked, the wonders data could do.

Like, if a simple regression problem on a freaking Excel sheet could impact the climate by reducing X% of energy consumption, I could only imagine what else I could do if I pursued this data path deeper.

So I shifted my focus into data and climate tech, which led me to pursue my master's in data science.

Mechanical to Data

Experience & Projects

Verizon Logo

Verizon Capstone Project

Advanced fault detection system using ML

XGBoostTableauML

Verizon Capstone Project

Led the development of an advanced fault detection system using XGBoost models. Processed and analyzed large-scale JSON logs for pattern recognition, and created comprehensive Tableau dashboards for real-time operational monitoring. Leveraged NJIT's Wulver High Performance Computing system for efficient processing of 50GB+ dataset, utilizing multiple nodes and GPU acceleration for enhanced computational performance.

XGBoostTableauSnowflakePythonML
Kansas City Crime Analysis

Kansas City Crime Analysis

Interactive crime data visualization and analysis

TableauData AnalysisVisualization

Kansas City Crime Analysis

Developed an interactive Tableau dashboard analyzing crime data from 2016-2022. Features include COVID-19 impact analysis, crime hotspot identification, and demographic trend analysis. Created comprehensive visualizations for law enforcement and city planning insights.

TableauData AnalysisVisualizationGISStatistics
R Web Scraping Project

Web Scraping with R

Automated data extraction from Genome Biology articles

RWeb ScrapingData Analysis

Web Scraping with R

Developed an automated web scraping solution using R to extract and analyze articles from Genome Biology. The tool collects comprehensive data including titles, authors, affiliations, publication dates, abstracts, and full text content, enabling efficient scientific literature analysis.

R ProgrammingrvestdplyrData MiningWeb Scraping
USA House Price Prediction

USA House Price Prediction

ML-powered real estate price prediction system

Machine LearningPythonRegression

USA House Price Prediction

Developed a comprehensive machine learning solution using multiple regression models (Random Forest, Gradient Boosting, Ridge CV, ElasticNet CV) to predict U.S. house prices. Analyzed key variables including bedrooms, bathrooms, size, and location to extract patterns for accurate price predictions in real estate applications.

Random ForestGradient BoostingRidge CVElasticNet CVFeature Engineering
Parkinson's Disease Progression Prediction

Parkinson's Disease Prediction

Time series forecasting for disease progression

Time SeriesARIMAHealthcare

Parkinson's Disease Prediction

Developed a predictive model for Parkinson's disease progression using time series forecasting with ARIMA models. Analyzed peptide abundance, protein expression, and clinical data to predict UPDRS scores. Implemented comprehensive data preprocessing and feature engineering for enhanced prediction accuracy.

Time Series AnalysisARIMA ModelsData PreprocessingFeature EngineeringHealthcare Analytics
Library Database Management System

Library Management System

Full-stack library database system with GUI

PythonSQLiteTkinter

Library Management System

Developed a comprehensive library management system with a user-friendly GUI using Python and Tkinter. Features include document checkout/return, fine computation, reader management, and advanced search capabilities. Implemented robust database operations using SQLite for efficient data management and retrieval.

PythonSQLiteTkinterGUI DevelopmentDatabase Design
Game of Life: Wormhole

Game of Life: Wormhole

Advanced cellular automata simulation with wormhole tunnels

PythonSimulationGame of Life

Game of Life: Wormhole

An advanced simulation of Conway's Game of Life featuring "wormhole" tunnels that connect different parts of the grid, enabling unique cellular automata behaviors. Built in Python, with visualizations and edge case explorations.

PythonCellular AutomataVisualizationEdge Cases
Energy Optimization

Energy Optimization for Pharma Labs

Led energy optimization projects for pharmaceutical laboratories, reducing HVAC energy consumption by 15%-23% by analyzing complex datasets, identifying trends, and forecasting energy requirements.

ARIMAEnsemble ModelsTableauExcelWeather Prediction

Energy Optimization for Pharma Labs

  • Increased energy demand forecasting accuracy by testing and deploying ARIMA and ensemble models for predictive analytics.
  • Improved data-driven decision-making by designing interactive Tableau dashboards, allowing executives to monitor key operational trends.
  • Streamlined financial operations by developing automated Excel tools for billing and client solutions.
  • Developed weather prediction models using machine learning and time series techniques, enhancing energy forecasting.
ARIMAEnsemble ModelsTableauExcelWeather Prediction

Skills & Technologies

ML/AI

Machine Learning & AI

TensorFlow, PyTorch, Scikit-learn, NLP, Computer Vision, MLOps

Data Engineering

Data Engineering

SQL, NoSQL, Apache Spark, Hadoop, Airflow, ETL, Snowflake, Databricks

Cloud/DevOps

Cloud & DevOps

AWS, Docker, Kubernetes, CI/CD, Terraform, GCP, Azure

Programming

Programming

Python, R, SQL, Shell Scripting, Git

Data Analysis

Data Analysis

Statistical Analysis, Data Visualization, Tableau, Power BI, A/B Testing, Time Series

Soft Skills

Soft Skills

Problem Solving, Communication, Team Leadership, Project Management, Agile

Data Meets Climate

I've been trying to pivot into climate-focused work. I'd love to explore if there might be any data-related roles or upcoming needs, happy to contribute in any capacity. I'm eager to learn and would love to explore new domains

Slide 1

Want to see my resume?

Plot twist: My resume is like a tech startup - constantly iterating and shipping new features!

Between you and me, I'm learning faster than my printer can keep up with! Drop me a line for the latest version - it might have changed while you were reading this! 😄

Request Latest Build v2025-07-16 🎮

Get in Touch

Have a question or want to work together? I'd love to hear from you.

Find Me Here

Future Goals & Interests

Climate Tech

Passionate about leveraging data science for climate action and sustainability. Experienced in energy optimization projects that reduced consumption by 15-23%. Seeking opportunities in climate tech and environmental data science.

AI/ML Advancement

Continuously exploring cutting-edge machine learning techniques, deep learning architectures, and emerging AI technologies. Focused on developing scalable, ethical AI solutions.

Data Engineering

Building robust, scalable data pipelines and infrastructure. Expertise in cloud platforms, real-time processing, and data architecture design for enterprise solutions.

Healthcare Analytics

Applying data science to healthcare challenges, from disease prediction to patient outcome analysis. Committed to improving healthcare through data-driven insights and predictive modeling.

Open Source

Contributing to the data science community through open source projects, knowledge sharing, and mentorship. Building tools and libraries that help others solve complex data challenges.

Mentorship

Passionate about helping others grow in data science and technology. Offering guidance, sharing knowledge, and supporting the next generation of data professionals and researchers.