Hi, I'm Nikhil Goutham

Data Scientist with expertise in Data Analysis, Machine Learning, Deep Learning and AI

About Me

Data Scientist with over 3 years of experience in building machine learning models, developing data pipelines, and extracting insights from complex datasets. Expertise in supervised and unsupervised learning, deep learning, and natural language processing.

Skilled in Python, SQL, and cloud-based data engineering solutions. Proven ability to design scalable AI models, optimize ETL workflows, and deploy data-driven solutions that enhance business decision-making.

Nikhil Goutham

Education

Master of Science in Data Science

New Jersey Institute of Technology (NJIT)

Graduated: December 2024

GPA: 3.95

• Specialized in advanced analytics, machine learning, and data engineering

• Focused on developing scalable solutions for real-world data challenges

• Applied AI/ML techniques to solve complex business problems

Relevant Coursework:

Big DataMachine LearningDeep LearningCloud ComputingData VisualizationData MiningStatisticsR

Bachelor of Technology in Mechanical Engineering

BML Munjal University, New Delhi, India

August 2017 - August 2021

GPA: 3.5

• Received academic scholarship for outstanding performance

• Sports Coordinator, Hero Challenge Fest (Jan-Feb 2018)

• Sports Representative Head, Banyan League (Jan-Feb 2019)

- Managed logistics for multiple teams, overseeing transportation, deliveries, inventory, and supply chain processes

- Collaborated with cross-functional stakeholders to optimize workflows and enhance team productivity

Experience & Projects

Verizon Logo

Verizon Capstone Project

Advanced fault detection system using ML

XGBoostTableauML

Verizon Capstone Project

Led the development of an advanced fault detection system using XGBoost models. Processed and analyzed large-scale JSON logs for pattern recognition, and created comprehensive Tableau dashboards for real-time operational monitoring. Leveraged NJIT's Wulver High Performance Computing system for efficient processing of 50GB+ dataset, utilizing multiple nodes and GPU acceleration for enhanced computational performance.

XGBoostTableauSnowflakePythonML
Kansas City Crime Analysis

Kansas City Crime Analysis

Interactive crime data visualization and analysis

TableauData AnalysisVisualization

Kansas City Crime Analysis

Developed an interactive Tableau dashboard analyzing crime data from 2016-2022. Features include COVID-19 impact analysis, crime hotspot identification, and demographic trend analysis. Created comprehensive visualizations for law enforcement and city planning insights.

TableauData AnalysisVisualizationGISStatistics
R Web Scraping Project

Web Scraping with R

Automated data extraction from Genome Biology articles

RWeb ScrapingData Analysis

Web Scraping with R

Developed an automated web scraping solution using R to extract and analyze articles from Genome Biology. The tool collects comprehensive data including titles, authors, affiliations, publication dates, abstracts, and full text content, enabling efficient scientific literature analysis.

R ProgrammingrvestdplyrData MiningWeb Scraping
USA House Price Prediction

USA House Price Prediction

ML-powered real estate price prediction system

Machine LearningPythonRegression

USA House Price Prediction

Developed a comprehensive machine learning solution using multiple regression models (Random Forest, Gradient Boosting, Ridge CV, ElasticNet CV) to predict U.S. house prices. Analyzed key variables including bedrooms, bathrooms, size, and location to extract patterns for accurate price predictions in real estate applications.

Random ForestGradient BoostingRidge CVElasticNet CVFeature Engineering
Parkinson's Disease Progression Prediction

Parkinson's Disease Prediction

Time series forecasting for disease progression

Time SeriesARIMAHealthcare

Parkinson's Disease Prediction

Developed a predictive model for Parkinson's disease progression using time series forecasting with ARIMA models. Analyzed peptide abundance, protein expression, and clinical data to predict UPDRS scores. Implemented comprehensive data preprocessing and feature engineering for enhanced prediction accuracy.

Time Series AnalysisARIMA ModelsData PreprocessingFeature EngineeringHealthcare Analytics
Library Database Management System

Library Management System

Full-stack library database system with GUI

PythonSQLiteTkinter

Library Management System

Developed a comprehensive library management system with a user-friendly GUI using Python and Tkinter. Features include document checkout/return, fine computation, reader management, and advanced search capabilities. Implemented robust database operations using SQLite for efficient data management and retrieval.

PythonSQLiteTkinterGUI DevelopmentDatabase Design

Skills & Technologies

Programming & ML

  • Python
  • R
  • SQL
  • TensorFlow
  • PyTorch
  • Scikit-Learn

Data Engineering

  • Apache Airflow
  • Snowflake
  • Databricks
  • Apache Spark
  • Hadoop
  • ETL Pipelines

Cloud & DevOps

  • AWS
  • Google Cloud
  • Azure
  • Docker
  • Kubernetes
  • CI/CD Pipelines

Data Analysis

  • Tableau
  • Power BI
  • A/B Testing
  • Statistical Analysis
  • Data Visualization
  • Time Series Analysis

Want to see my resume?

Plot twist: My resume is like a tech startup - constantly iterating and shipping new features!

Between you and me, I'm learning faster than my printer can keep up with! Drop me a line for the latest version - it might have changed while you were reading this! 😄

Request Latest Build v2025-06-06 🎮

Get in Touch

Have a question or want to work together? I'd love to hear from you.