About Me

Innovative Data Scientist with expertise in implementing advanced AI solutions, including hierarchical text classification, large language models, and optimization algorithms, to drive efficiency and enhance decision-making.

Proven track record in demand forecasting, recommendation systems, and streamlining workflows, achieving significant operational improvements. Skilled in leveraging data-driven strategies to deliver impactful business insights and scalable solutions.

Stack: Python, SQL, BigQuery, Metabase, PyTorch, Airflow, GCP, AWS

Experience

Data Scientist

Tribun Network - KG Media
August 2024 - Present | Jakarta, Indonesia.
  • Researched and Implemented State of The Art Hierarchical Text classification (HiDEC 2024) using PyTorch to standardize news content management
  • Implemented large language models (LLMs) Qwen2.5 using Ollama for data conditioning and labeling, improving data quality
  • Enhanced the efficiency of an existing recommendation system, optimizing CPU usage from 100% to just 20% by utilizing a native vector database (Qdrant), significantly improving both efficiency and content delivery.
  • Streamlined the Entity Extraction pipeline using BERT model, reducing processing time from 2 hours daily to just 15 minutes

Data Scientist

Praktis.co
October 2022 - July 2024 | Jakarta, Indonesia.
  • Developed forecasting models using Vertex AI AutoML, for easy and fast development ML systems for simulated forecasting.
  • Implemented text similarity search using word embededding and cosine similarity to map product categories across online marketplaces.
  • Collaborated with academic experts to formulate optimization models (OR model) aimed at enhancing production scheduling efficiency implemented using Pyomo, and Lingo and also developed a heuristic algorithm in Python
  • Developed Min-Max inventory monitoring logic and visualized using Metabase to enhance inventory management and out-of-stock prevention
  • Pioneered the development of advanced demand forecasting models (XGBoost) using Python orchestrated by Airflow for batch inference, ensuring effective inventory management.

Data Analyst

Praktis.co
November 2021 - October 2022 | Jakarta, Indonesia.
  • Developed forecasting models for demand prediction for thousands of products at the SKU level
  • Cleaned and migrated reporting dashboards data using data warehouse achieved lower response time dashboards
  • Conducted Ad-hoc analysis

Data Science Trainee

Jakarta Smart City
August – October 2021 | Remote
  • Researched a Few-Shot Learning method to categorize CRM report images and improve model flexibility for new target classes without re-training.
  • Deployed prototype model for serving web app and prediction API.
  • Getting insight how big data can be implemented in the various sectors to improve the quality of life of the citizens

Geophysicist Research Intern

Pusat Penelitian dan Pengembangan Geologi Kelautan (PPPGL)
August – October 2019 | Bandung, Indonesia
  • Seismic Data Pre-processing, Processing and Imaging
  • Multiple attenuation using SRME and Parabolic Radon Transform
  • Post-Stack and Pre-Stack Kirchhoff Time Migration
  • Geological Structure Interpretation

Student Intern

PT TIMAH TBK
September – October 2018 | Pangkal Pinang, Indonesia
  • Data Acquisition of magnetic and geoelectrical survey and able to operate Proton Precision Magnetometer GMS-19T and Supersting R8
  • Data processing of magnetic and geoelectrical survey
  • Interpretation of geomagnetic map to delineate potential area of primary tin mineralization

Teaching Assistant

Universitas Jambi
January – April 2018 | Jambi, Indonesia
  • Teaching sophomore geophysical student about seismic refraction method
  • Teaching student how to determine seismic refraction acquisition survey design using 24 channels seismograph
  • Teaching student how to process seismic refraction data to get subsurface image

Education

  • University of Jambi

  • Bachelor of Engineering in Geophysical Engineering (Sarjana Teknik)
    Aug 2015 - Jan 2020 | Jambi, Indonesia
    Thesis: Seismic Pre-Stack and Post-Stack Kirchhoff Time Migration

Skills

  • Python
  • PyTorch
  • ArcGIS, Surfer, Global Mapper
  • BigQuery, SQL
  • Metabase, Data Studio
  • GCP,AWS
  • Microsoft Office (Excel, Word, Power Point)

Personal Projects

These are my personal project

Few-Shot Image Classification With JAKI Report Images

In this project, we aim to improve the JakLapor service using state-of-the-art image classification in the hopes it will be able to reduce manual categorization for each report and as a category recommendation for the end-user reports they are taking.

Random Forest and ANN Model with Oversampling Data

How these models performance compare to each others with oversampling data using SMOTE (Synthetic Minority Oversampling Technique). how we handle data like this? we can use oversampling technique with minority class in order to make real-like data and finally the data has the same distribution.

Drone Aerial View Segmentation

How to teach drone to see what is below and segment the object with high resolution. Teaching drone to see is quite challenges due to bird’s eye view and most of pre-trained models are trained in normal images we see (point of view) in daily basis (ImageNet, PASCAL VOC, COCO).

Semantic Segmentation in Seismic Images

This is semantic segmentation project to delineate salt bodies in seismic images with U-Net Architecture with Resnet18 and Resnet34. Salt can be a tricky process to image in seismic. To better interpret salt bodies in seismic, geologic information needs to be integrated in order to accurately image salt. Salt and sediment interactions can cause reservoir traps and hydrocarbon traps.

Mineral Image Classification

I classify 7 mineral classes and produce robust model that can be used to classify minerals, I've tried it using mineral images from Google.

Covid-19 Early Detection from X-ray Images Aided by AI

Covid-19 is an unprecedented situation for society and it’s going to change how human lives for the next generation. It’s also making medical testing has to run quickly in massive test samples, which human can be tired and the machine does not, in a situation like this AI really help.

Sentiment Analysis COVID-19

This Project about classify sentiment from scarapped twitter tweet using private data

Text Extraction

Save your reading time while machine extracts important sentences and make summary from PDF file, using TF-IDF as weight for each word in sentences and return the highes score as summary and save your time.

Combined Multiple Attenuation Methods and Geological Interpretation : Seram Sea Case Study 2D Marine Seismic Data

This research uses predictive deconvolution and FK-filter to attenuate short period multiple from their move out, then continued by SRME method to predict multiple that cannot be attenuated from previous method, then followed by Radon transform to attenuate multiple that still left and cannot be attenuated by SRME method. The result of each method then compared to each other to see how well multiple attenuated.