Hi, I'm Nikhil Goutham
Data Scientist with expertise in Data Analysis, Machine Learning, Deep Learning and AI
About Me
Data Scientist with over 3 years of experience in building machine learning models, developing data pipelines, and extracting insights from complex datasets. Expertise in supervised and unsupervised learning, deep learning, and natural language processing.
Skilled in Python, SQL, and cloud-based data engineering solutions. Proven ability to design scalable AI models, optimize ETL workflows, and deploy data-driven solutions that enhance business decision-making.

Education
Master of Science in Data Science
New Jersey Institute of Technology (NJIT)
Graduated: December 2024
• Specialized in advanced analytics, machine learning, and data engineering
• Focused on developing scalable solutions for real-world data challenges
• Applied AI/ML techniques to solve complex business problems
Relevant Coursework:
Bachelor of Technology in Mechanical Engineering
BML Munjal University, New Delhi, India
August 2017 - August 2021
• Received academic scholarship for outstanding performance
• Sports Coordinator, Hero Challenge Fest (Jan-Feb 2018)
• Sports Representative Head, Banyan League (Jan-Feb 2019)
- Managed logistics for multiple teams, overseeing transportation, deliveries, inventory, and supply chain processes
- Collaborated with cross-functional stakeholders to optimize workflows and enhance team productivity
Experience & Projects

Verizon Capstone Project
Advanced fault detection system using ML
Verizon Capstone Project
Led the development of an advanced fault detection system using XGBoost models. Processed and analyzed large-scale JSON logs for pattern recognition, and created comprehensive Tableau dashboards for real-time operational monitoring. Leveraged NJIT's Wulver High Performance Computing system for efficient processing of 50GB+ dataset, utilizing multiple nodes and GPU acceleration for enhanced computational performance.

Kansas City Crime Analysis
Interactive crime data visualization and analysis
Kansas City Crime Analysis
Developed an interactive Tableau dashboard analyzing crime data from 2016-2022. Features include COVID-19 impact analysis, crime hotspot identification, and demographic trend analysis. Created comprehensive visualizations for law enforcement and city planning insights.

Web Scraping with R
Automated data extraction from Genome Biology articles
Web Scraping with R
Developed an automated web scraping solution using R to extract and analyze articles from Genome Biology. The tool collects comprehensive data including titles, authors, affiliations, publication dates, abstracts, and full text content, enabling efficient scientific literature analysis.

USA House Price Prediction
ML-powered real estate price prediction system
USA House Price Prediction
Developed a comprehensive machine learning solution using multiple regression models (Random Forest, Gradient Boosting, Ridge CV, ElasticNet CV) to predict U.S. house prices. Analyzed key variables including bedrooms, bathrooms, size, and location to extract patterns for accurate price predictions in real estate applications.

Parkinson's Disease Prediction
Time series forecasting for disease progression
Parkinson's Disease Prediction
Developed a predictive model for Parkinson's disease progression using time series forecasting with ARIMA models. Analyzed peptide abundance, protein expression, and clinical data to predict UPDRS scores. Implemented comprehensive data preprocessing and feature engineering for enhanced prediction accuracy.

Library Management System
Full-stack library database system with GUI
Library Management System
Developed a comprehensive library management system with a user-friendly GUI using Python and Tkinter. Features include document checkout/return, fine computation, reader management, and advanced search capabilities. Implemented robust database operations using SQLite for efficient data management and retrieval.
Skills & Technologies
Programming & ML
- Python
- R
- SQL
- TensorFlow
- PyTorch
- Scikit-Learn
Data Engineering
- Apache Airflow
- Snowflake
- Databricks
- Apache Spark
- Hadoop
- ETL Pipelines
Cloud & DevOps
- AWS
- Google Cloud
- Azure
- Docker
- Kubernetes
- CI/CD Pipelines
Data Analysis
- Tableau
- Power BI
- A/B Testing
- Statistical Analysis
- Data Visualization
- Time Series Analysis
Want to see my resume?
Between you and me, I'm learning faster than my printer can keep up with! Drop me a line for the latest version - it might have changed while you were reading this! 😄
Request Latest Build v2025-06-06 🎮Get in Touch
Have a question or want to work together? I'd love to hear from you.