Data Professor

Exploring the intersection of data and AI by sharing insights through blog posts, YouTube videos, and research papers.

About Me

Get to know the person behind the content

Chanin Nantasenamat

Chanin Nantasenamat, PhD

Chanin is a Senior Developer Advocate at Snowflake Inc. where he creates educational content on the development of data apps with Streamlit and Snowflake. Prior to working in tech, he was a full Professor of Bioinformatics where he heavily relied on Python coding in conducting research and teaching bioinformatics.

On the side, he creates educational content in data science and AI on his YouTube channel called the Data Professor since 2019. Collectively, all of his educational videos had received more than 10 million views on YouTube.

Recently, he had launched a course on Fast Prototyping of GenAI Apps with Streamlit in collaboration with DeepLearning.AI × Snowflake.

Latest Blog Posts

Deep dives into technology trends, tutorials, and industry insights

Overview of Snowsight: Snowflake's Web Interface

Are you still jumping between multiple tools to manage your data? What if there was one unified platform to ingest, query, analyze and even build apps directly from your browser? Whether you're a data analyst, an engineer, or a business user, understanding Snowsight is crucial for maximizing your productivity and managing your data like a pro....

Developer's Guide to Modin and Snowflake Cortex

Have you ever wanted to supercharge your data processing with AI while working in a familiar pandas environment? Many data teams struggle with integrating advanced AI capabilities into their existing data workflows without compromising performance or requiring complex infrastructure changes. Snowflake Cortex provides a solution by enabling seamless integration of powerful AI capabilities directly into Modin DataFrames within the unified Snowflake ecosystem....

Developer's Guide to Snowflake Cortex Agent

Ever wondered how to make your enterprise data work smarter, not just harder? Snowflake is stepping up its game in enterprise AI with Snowflake Intelligence. You can think of it as a powerful platform that empowers developers to build clever data agents that can analyze, summarize, and even take action on your company's valuable data....

YouTube Videos

Video tutorials, tech reviews, and coding walkthroughs

Build an LLM-Powered Voice Agent in Python

In this video, you'll build a Do It Yourself LLM-powered voice agent in Python that you can chat to verbally. This voice agent retains memory throughout the conversation and performs both speech-to-text (transcription) and text-to-speech (speech synthesis). With this voice agent, you can chat with the LLM hands-free like you would with a friend! There's lots to customize here and make your own voice agent, watch the video and find out how!

Bioinformatics Project from Scratch PART 2 - Preparing the Data Set

In this video, you'll learn how to prepare and clean bioactivity data for the aromatase inhibitors in Python using the RDKit library. We'll cover the process of loading data from the CHEMBL database, removing duplicate molecules, and standardizing SMILES notations (a string representation of a molecule) in order to create a high-quality, non-redundant dataset suitable for machine learning model building and further data analysis.

Building a Call Center Analytics Pipeline in Python

In this video, you'll learn how to create an end-to-end call center analytics workflow using Assembly AI and Python. We'll walk you through the process of transcribing audio, identifying speakers, analyzing sentiment, and visualizing the results using heat maps. The tutorial covers key concepts such as API integration, natural language processing, and data visualization techniques to extract meaningful insights from call center conversations.

Research Papers

Academic contributions and technical research in emerging technologies

Building bioinformatics web applications with Streamlit
Cheminformatics, QSAR and Machine Learning Applications for Novel Drug Development2023

In recent years, we have witnessed an exponential growth in the generation of data in the biological sciences. To harness such big biological data, computational and machine learning pipelines have become instrumental for exploratory data analysis, predictive modeling, and data-driven decision-making. Open-source web development frameworks like Streamlit enable the scientific community to build and share web applications by using less code, thereby speeding up research and development. This chapter explores the usage of Streamlit for the development of software and tools in the field of bioinformatics.

Towards reproducible computational drug discovery
Journal of Cheminformatics2020

The reproducibility of experiments has been a long standing impediment for further scientific progress. Computational methods have been instrumental in drug discovery efforts owing to its multifaceted utilization for data collection, pre-processing, analysis and inference. This article provides an in-depth coverage on the reproducibility of computational drug discovery. This review explores the following topics: (1) the current state-of-the-art on reproducible research, (2) research documentation (e.g. electronic laboratory notebook, Jupyter notebook, etc.), (3) science of reproducible research (i.e. comparison and contrast with related concepts as replicability, reusability and reliability), ...

A practical overview of quantitative structure-activity relationship

Quantitative structure-activity relationship (QSAR) modeling pertains to the construction of predictive models of biological activities as a function of structural and molecular information of a compound library. The concept of QSAR has typically been used for drug discovery and development and has gained wide applicability for correlating molecular information with not only biological activities but also with other physicochemical properties, which has therefore been termed quantitative structure-property relationship (QSPR). Typical molecular parameters that are used to account for electronic properties, ...

Media Kit

Comprehensive audience demographics and reach statistics

Social Media Reach

213,000+
Subscribers
YouTube
29,000+
Followers
X/Twitter
12,000+
Followers
LinkedIn
9,000+
Followers
GitHub
2,500+
Subscribers
Newsletter
Top Geographies
Audience distribution by country (2025)
India30.4%
United States12.8%
United Kingdom2.9%
Pakistan2.2%
Germany2.1%
Age Demographics
Audience age distribution (2025)
18-24 years44.3%
25-34 years33.9%
35-44 years13.8%
45-54 years6%
55-64 years1.6%
65+ years0.3%
Gender Distribution
Audience gender breakdown (2025)
Male76.9%
Female23%

Newsletter

Subscribe to my newsletter for the latest insights on data science, AI, and tech tutorials delivered straight to your inbox.