EXPERIENCE
Machine Learning Research Engineer
(Oct 2024 - present)
At Rutgers New Jersey Medical School, I've been developing advanced prediction models focused on molecular inhibition for drug discovery applications. My work combines traditional machine learning approaches with deep learning architectures to enhance pharmaceutical research capabilities.
Key Contributions:
Engineered prediction models for molecular inhibition by leveraging RDKit to generate comprehensive molecular descriptors, significantly enhancing feature selection for downstream modeling tasks
Built a robust machine learning pipeline integrating SVC, Random Forest, and XGBoost models optimized through Optuna, achieving an impressive ROC-AUC of 0.78 with rigorous k-fold cross-validation
Implemented advanced neural architectures including RNN with LSTM and Attention Mechanism components specifically designed for sequential modeling of molecular structures, substantially increasing model sensitivity to subtle molecular properties
Developed and fine-tuned a Directed Message Passing Neural Network (DM-PNN) using Chemprop framework, reaching outstanding performance metrics with an F1 score of 0.87 and PR-AUC of 0.82, directly contributing to enhanced early-stage drug discovery processes
Integrated GPT, Claude, and BioMedLM, a domain-specific large language model, to analyze research literature and extract relevant molecular binding properties, accelerating the identification of promising inhibitor candidates by 40%
Technologies:
Python, PyTorch, RDKit, Optuna, HyperOpt, Raytune, Deepchem, Scikit-learn, Pandas, NumPy, Matplotlib, Seaborn, Jupyter Notebooks, Git, Docker, GPT, Claude, BioMedLM, Transformers, Chemprop, DGL (Deep Graph Library), Weights & Biases, Linux


Lead Developer
(Oct 2023 - May 2024)
At The Daily Targum, Rutgers University's independent student newspaper, I led technical initiatives to enhance website performance, user experience, and backend infrastructure while providing technical leadership to a team of developers.
Key Contributions:
Increased website runtime by 35% through strategic file migration to DynamoDB and S3 parallel transfer with optimized indexing, significantly improving user experience for over 50,000 monthly visitors
Led a team of 4 in comprehensive front-end redesign using HTML/CSS and JavaScript, boosting SEO ranking by 15% and ad engagement by 20%, directly contributing to increased revenue
Reduced costs by 50% by evaluating CMS options and communicating integration insights with stakeholders to streamline development processes and operational efficiency
Implemented data analytics pipeline using Google Analytics and custom Python scripts to track user engagement patterns, providing actionable insights that increased article read-through rates by 22%
Developed an automated content recommendation system using collaborative filtering and TF-IDF analysis to personalize article suggestions, increasing user session duration by 40% and reducing bounce rates by 18%
Deployed GPT-3.5 for headline optimization through A/B testing framework, comparing ML-generated headlines against human-written ones, resulting in 27% higher click-through rates for optimized content
Technologies:
HTML5, CSS3, JavaScript, jQuery, ReactJS, AWS (S3, DynamoDB, EC2, CloudFront), Node.js, Git, GitHub Actions, WordPress, Google Analytics, Python, Pandas, Scikit-learn, TensorFlow, OpenAI API, SQL, Docker, Netlify, Vercel, CI/CD Pipeline


Data Scientist (Graduate Research Assistant - Statistics Department)
(Jan 2023 - July 2023)
As a Graduate Research Assistant in the Statistics Department, I focused on making advanced statistical models accessible to researchers across disciplines while enhancing methodological approaches for improved prediction accuracy.
Key Contributions:
Simplified advanced statistical models for non-technical users by converting Python models to Stata functions, significantly improving accessibility and adoption among social science researchers
Achieved 0.95 accuracy for Random Forest with cross-validation through Hat Matrix optimization for n-dimension array predictions, establishing a new departmental benchmark for model performance
Improved Causal Inference methodologies by incorporating Two-Stage Curvature Identification, which elevated algorithm pipeline accuracy by 15% across multiple research datasets
Developed novel time series forecasting techniques combining SARIMA models with feature engineering approaches, reducing prediction error by 23% compared to standard implementations
Created a natural language interface using LangChain and Llama 2, enabling researchers to query statistical results through conversational prompts, reducing analysis time by 65% for non-programmers
Technologies:
Python, R, Stata, SQL, Pandas, NumPy, Scikit-learn, StatsModels, TensorFlow, PyTorch, Matplotlib, Seaborn, Git, Jupyter Notebooks, LangChain, Llama 2, HuggingFace, AWS (EC2, S3), Docker, LaTeX, Tableau


Machine Learning Engineer (Global Business Services)
(June 2021 - August 2021)
At Siemens, I focused on implementing automation solutions and leveraging NLP techniques to improve business processes and customer feedback analysis for global marketing initiatives.
Key Contributions:
Automated the installation and update of 5000+ driver files for Windows Desktops by building a custom Bash script to run on boot, reducing IT support tickets by 73% and saving approximately 120 work hours monthly
Achieved Silhouette Score of 0.89 for clusters of consumer comments, derived from fine-tuned SentenceBERT models in Python, enabling precise market segmentation and targeted campaign development
Elevated score to 0.94 by incorporating Siamese network structures as part of a global effort to streamline marketing using customer feedback, improving product development prioritization
Engineered a time series analysis framework using ARIMA and Prophet to forecast consumer trends from historical feedback data, providing 85% accurate 6-month projections for product planning
Implemented GPT-J fine-tuned on company documentation to automatically generate comprehensive responses to customer support inquiries, reducing response time by 60% while maintaining 92% accuracy
Technologies:
Python, Bash, PowerShell, TensorFlow, PyTorch, Sentence-Transformers, HuggingFace, Scikit-learn, Pandas, NumPy, Matplotlib, Plotly, GPT-J, FAISS, ElasticSearch, Git, Docker, Jenkins, AWS (EC2, S3, Lambda), Linux, Windows Server, JIRA


Fullstack Web Developer / Analyst
(June 2020 - December 2020)
I helped create internal systems for this logistics company. Kaiser Exports was a growing company that wanted to have a better online presence and backend portal for easier access. Here I was guided by a senior developer who helped me gain more insight into development.
Key Contributions:
Created a comprehensive internal system for tracking inventory and materials in transport, enhancing order management processes and reducing processing time by 27%
Designed and built a new website from scratch using HTML, CSS, JavaScript, PHP, and SQL to establish the company's digital presence, improving client engagement and accessibility
Developed an automated reporting system using Python and basic ML classification techniques to categorize shipping manifests, reducing manual document processing time by 40%
Implemented database querying and modification tools for efficient inventory access, integrating simple predictive models to anticipate stock requirements based on historical patterns
Established proper documentation for all systems, enabling faster onboarding of new employees and ensuring knowledge transfer across the organization
Technologies:
HTML5, CSS3, JavaScript, PHP, MySQL, Python, Pandas, Scikit-learn, Git, Bootstrap, jQuery, Apache, Jupyter Notebooks, Microsoft Office Suite

