Skills

Cloud Native

  • Docker
  • Kubernetes
  • Grafana
  • Telegraf
  • InfluxDB

Data Science / Machine Learning

  • TensorFlow
  • PyTorch
  • Keras
  • XGBoost
  • LightGBM
  • Scikit-learn
  • Pandas
  • NumPy
  • SciPy

Natural Language Processing (NLP)

  • BERT
  • GPT
  • NLTK
  • SpaCy
  • Transformers
  • Word2Vec
  • Text Preprocessing
  • Sentiment Analysis
  • Named Entity Recognition (NER)

Network Analysis

  • Graph Neural Networks
  • NetworkX
  • Gephi
  • PyGraphistry
  • Graph Databases (Neo4j, TigerGraph)
  • Network Path Anlaysis/Modelling
  • Coordination Detection

Data Engineering

  • Apache Spark
  • Apache Kafka
  • Apache Parquet
  • Apache Airflow
  • Databricks
  • Data Lakes (Delta Lake)
  • ETL Pipelines

Cloud Platforms

  • AWS (S3, EC2)
  • Google Cloud Platform (GCP)
  • Microsoft Azure
  • IBM Cloud

Database Management

  • SQL
  • NoSQL (MongoDB, Cassandra)
  • BigQuery
  • PostgreSQL
  • MySQL
  • Redis

Visualization & Dashboarding

  • Tableau
  • PowerBI
  • Matplotlib
  • Seaborn
  • Plotly
  • Dash by Plotly
  • Streamlit

Version Control & Collaboration

  • Git
  • GitHub
  • GitLab
  • Bitbucket
  • Docker
  • Jupyter Notebooks

Statistical Analysis

  • Hypothesis Testing
  • Traditional Statistics Methods
  • Bayesian Methods
  • Time Series Analysis

Machine Learning Operations (MLOps)

  • MLflow
  • Kubeflow
  • TFX (TensorFlow Extended)
  • PyToch
  • Model Monitoring
  • Model Deployment

Work Experience (5)

Jan 2019 - Current
Chief Data Scientist
Durian Corp
https://www.duriancorp.com
  • Responsible for the design and implementation of data platform and solutions

  • Conducted regular training sessions for the data science team and clients on digital transformation and data science

  • Lead a team of 12 developers, data scientists, data engineers with focus on network analysis, machine learning, and natural language processing.

  • Provide business-critical, custom-designed data solutions for clients across various industries including finance, healthcare, retail, and pharma.

  • Work with researchers from NGOs, government, and academia to develop and implement data-driven solutions for social change and social good!

Jun 2018 - Current
Chief Technology Officer
EdTech (Thailand) Co. Ltd.
  • Responsible for databasing speech data, creating statistically-driven speech models, convolutional neural network training, and data acquisition.

  • Oversaw content and curriculum development in English with a team of 7 content creators.

  • Maintained a growing database of speech data for model training.

May 2017 - Current
General Manager & Co-Owner
California Degree Co. Ltd
  • Oversaw Databasing, Customer Regression Models, Projection, and Forecasting.

  • Responsible for customer evaluation and placement assessments.

  • Developed highly tailored testing and learning materials.

Sep 2010 - Jun 2012
Research Assistant
The Ohio State University
  • Conducted research on the syntax and semantics of Mushunguli, a Bantu language spoken in Somlia.

  • Developed a novel method for representing complex gender agreement in languages with multi-class noun classes.

Sep 2011 - Jun 2012
Research Assistant
Massachusets Institute of Technology
  • Conducted research on the lens filters for large CNN models (neural networks)

  • Worked with a multi-university team of researchers on CMUSphinx, an open source speech recogniton software.

Volunteer

Sep 2010 - Jun 2013
Level III Master Tutor
Student Athlete Support Services
Jul 2011 - Jun 2013
Head Linguistics Tutor and Ersatz Professor/Lecturer
The Ohio State University
Sep 2010 - Jun 2013
German Language Teacher/Tutor
Ohio German Language School

Education (5)

- 2011
B.S.
Computational Linguistics
The Ohio State University
- 2011
B.A.
German
The Ohio State University
- 2011
B.S.
Molecular Genetics
The Ohio State University
- 2012
M.Sc.
Computational Linguistics (NLP)
Massachusetts Institute of Technology
- 2011
M.Sc.
Molecular Genetics
The Ohio State University

Certificates

2024
IBM Data Science Professional Certificate
IBM
2023
Data Science with Python Expert
IBM
Advanced Learning Algorthms
Stanford University

Publications

Concurrent Filtering and Smoothing: A Parallel Architecture for Real-Time Navigation and Full Smoothing
Gender Conflict Resolution in Mushunguli

Languages

English

Native or bilingual proficiency

German

Native or bilingual proficiency

Thai

Near Fluent

Interests

Areas of Research

  • Semnatic Embeddings
  • RAG Optimization
  • Influence Operation Identification
  • Temporal Point Processing & GMM

Linguistics

  • Syntax
  • Semantics
  • Language Acquisition

German Literature

  • Romantic Literature
  • Local Dialects

Molecular Genetics

  • Genetic Disorders
  • Statistical Genetics

Humanities

  • Anthropology
  • Linguistics
  • Political Science
  • Gender & Identity Theory

Technology

  • Data Science
  • DevOps
  • Scaling & Optimization