Denys Katerenchuk

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Proin sit amet vulputate augue. Duis auctor lacus id vehicula gravida. Nam suscipit vitae purus et laoreet. Donec nisi dolor, consequat vel pretium id, auctor in dui. Nam iaculis, neque ac ullamcorper.

Posted by Denys Katerenchuk

Building KAI-GPT v4: What I Learned About Domain-Specialized LLMs

July 15, 2025
Posted by Denys Katerenchuk
LLM, Deep Learning

Working on large language models for the past few years has taught me one crucial lesson: general-purpose models, no matter how capable, hit a wall when they encounter specialized domains like banking. At Kasisto, I’ve been driving development of KAI-GPT v4, and the journey has reinforced why domain expertise matters more than raw scale.

Building KAI-GPT v4: What I Learned About Domain-Specialized LLMs

July 15, 2025
Posted by Denys Katerenchuk
LLM, Deep Learning

You Should Probably Read This: Hedge Detection

January 1, 2022
Posted by Denys Katerenchuk
Natural Language Processing, Hedge Detection, Neural Networks

Imagine a doctor says “The patient likely has pneumonia” versus “The patient has pneumonia.” That single word changes everything. The first statement hedges the diagnosis, expressing uncertainty that could affect treatment decisions. The second makes a definitive claim.

You Should Probably Read This: Hedge Detection

January 1, 2022
Posted by Denys Katerenchuk
Natural Language Processing, Hedge Detection, Neural Networks

RankDCG: Ranking-Ordering Evaluation Measure

February 1, 2017
Posted by Denys Katerenchuk
RankDCG, Evaluation Metrics, Information Retrieval, Ranking Algorithms

Advanced algorithms improve our lives in subtle, but consistent ways. Google search results seldom make us go to the second page and Netflix magically recommends an interesting movie. These are outcomes of multiple trials and errors to find the best recommendation algorithm. How do these companies determine which algorithm is the best? Evaluating different ranking algorithms is not a trivial task. This is why we developed the Rank Discounter Cumulative Gain (RankDCG) evaluation measure.

RankDCG: Ranking-Ordering Evaluation Measure

February 1, 2017
Posted by Denys Katerenchuk
RankDCG, Evaluation Metrics, Information Retrieval, Ranking Algorithms

Budget Friendly Deep Learning Home PC

April 22, 2016
Posted by Denys Katerenchuk
Deep Learning PC

For over ten years, my main work tool has been a laptop. For most tasks — whether it is software development or research — laptops deliver good performance with the advantage of working from anywhere. More demanding work gets done on remote servers or cloud computing platforms. In my lab, I set up a Sun Grid Engine cluster of six servers that brings together the power of 36 Xeon CPUs. This system was used to build Automatic Speech Recognition (ASR) models (which took over a month to train) and to run many other extensive experiments. My laptop simply couldn’t compete with it. However, in recent years with the development of deep learning, GPUs have become incredibly powerful tools in research. For example, when I needed to run the same ASR training, a single GPU reduced training time from over 30 days to around 6.

Budget Friendly Deep Learning Home PC

April 22, 2016
Posted by Denys Katerenchuk
Deep Learning PC

Predicting Stock Prices: What Does Twitter Know About the Market?

May 29, 2015
Posted by Denys Katerenchuk
Stock Market Prediction, Twitter Analysis, Machine Learning, Finance, Social Media Analytics

Introduction

Can public opinion predict stock market movements? With recent advancements in natural language processing (NLP), this question has gained significant attention in both academic and financial circles. As an NLP scientist, I decided to investigate this relationship and satisfy my own curiosity about the intersection of social media sentiment and financial markets.

Predicting Stock Prices: What Does Twitter Know About the Market?

May 29, 2015
Posted by Denys Katerenchuk
Stock Market Prediction, Twitter Analysis, Machine Learning, Finance, Social Media Analytics

Introduction

War Story: Text Analysis – Looking for Questions in Pickup Dating Community

Abstract

This study presents a comprehensive text analysis of the pickup dating community through the lens of social network analysis, natural language processing, and community psychology. Working alongside Anders Wallace, an Anthropology Ph.D. student, we analyzed forum data from fastseduction.com to understand community structure, social hierarchies, and linguistic patterns. Our findings reveal complex social dynamics, educational diversity, and hierarchical structures that challenge common stereotypes about this community.

War Story: Text Analysis – Looking for Questions in Pickup Dating Community

Abstract

Sentiment Analysis of Manhattan Using Instagram

[Note: Unfortunately, most images were lost due to changes in carto.com service.]

Sentiment Analysis of Manhattan Using Instagram

[Note: Unfortunately, most images were lost due to changes in carto.com service.]

NYC Data Visualization

Introduction

In this blog post, we will explore various data visualization techniques using datasets from New York City. Our primary focus will be on analytics related to Broadway shows and how we can extract meaningful insights from the data.

NYC Data Visualization

Introduction

Interspeech2014 @ Singapore – Highlights

October 12, 2014
Posted by Denys Katerenchuk
Conference

This year my colleagues from SpeechLab and I were invited to Interspeech2014 conference to present our work. The conference was held in Singapore, a great city with multilingual culture. The city was the perfect choice for an NLP and speech conference. I was impressed by how well everything was organized from the opening ceremony to the very end. It made the 24-hour flight from NYC to Singapore well worth it. The best part of the conference was the whole slew of interesting research papers in various areas of NLP.

Interspeech2014 @ Singapore – Highlights

October 12, 2014
Posted by Denys Katerenchuk
Conference