I'm currently working as a Sr. Solutions Consultant at Databricks, helping clients tackle some of the toughest data challenges out there. Previously, I worked as a Research Scientist at The Vera C. Rubin Observatory - Chile and earned an MS in Data Science from The University of Washington - Seattle. I also worked as a Data & Machine Learning Engineer at Shell, where I built and deployed advanced data-driven products that helped traders generate $500+ Million/year in revenue. With hands-on experience taking ideas to production in both startups and large companies, I have a strong foundation in data science, machine learning, data engineering, and operations. Let’s connect and create something amazing together!
Subjects: Introduction to Statistics and Probability, Data Visualization, Software
Design, Applied Statistics and Experimental Design, Data Management,
Statistical Machine Learning, Human-Centered Data Science, Scalable Data Systems and
Algorithms.
Co-curricular: Organizer at The
RAISE Group, Graduate Research
Assistant at the
DiRAC Institute, Capstone
with
Virufy.
Subjects: Data Structures & Algorithms, Object-Oriented Programming Methodology, Big
Data, Open-Source Technologies,
Soft Computing, Database Management System, Software Engineering, Data Mining & Business
Intelligence, Distributed Systems,
Cloud Computing, Software Project Management, Intelligent System.
Co-curricular: Co-founder of the Coders' Club.
Concepts & Technologies
Data Science, Data Engineering, Machine Learning, Natural Language Processing
(NLP), Computer/Machine Vision,
MLOps (Machine Learning Operations), ETL (Extract-Transform-Load), Data Visualization, A/B
Testing,
Data Modeling, Database Management, Data Analysis, Data Wrangling, Data Warehousing, RAG.
Programming & Scripting Languages
Regular: Python, SQL, JavaScript, HTML, CSS
Past Experience: C, C#, Java, PHP
Tools & Framworks
Data Engineering: Databricks, Apache Spark, Git, Microsoft Azure, AWS, GCP, Microsoft
SQL
Server, Alteryx, Apache Airflow, Docker.
Data Science: Micorsoft Azure ML Studio, HuggingFace, PyTorch, Scikit-learn,
TensorFlow,
Pandas, Numpy,
Matplotlib, OpenCV, Tableau, Flask, Keras, NLTK, FastAPI, Streamlit, Seaborn, LangChain,
Vector Database (Pinecone).