Kim 2 ML

Applied AI notes for data scientists, analysts, and builders

Blog

Research breakdowns, practical implementation notes, and opinionated takes from real-world data and AI work.

Coverage – Aware Web Crawling for Domain – Specific Supplier Discovery via a Web – Knowledge – Web Pipeline

March 21, 2026

Paper

ref: https://arxiv.org/pdf/2602.24262 Summary Most knowledge graphs fail in production because they are incomplete and hard to maintain. This paper flips that weakness…
Read breakdown
How to Train Your Deep Research Agent? Prompt, Reward, and Policy Optimization in Search-R1

March 1, 2026

Paper

ref: https://arxiv.org/pdf/2602.19526v1 Summary I used to think longer prompts and step-by-step reasoning always make AI smarter. In daily use, that often helps.…
Read breakdown
Prompt Repetition Improves Non-Reasoning LLMs

February 22, 2026

Paper

ref: https://arxiv.org/pdf/2512.14982 Summary The easiest AI upgrade might be just “copy and paste”. Google researchers tested with 7 leading models and found…
Read breakdown
Verbalized Sampling: How to mitigate Mode Collapse and unlock LLM diversity

February 9, 2026

Paper

ref: https://arxiv.org/pdf/2510.01171 Summary AI models repeat safe answers, but a simple prompt tweak can unlock far better thinking. Researchers introduced “verbalized sampling,”…
Read breakdown
Tool-MAD: A Multi-Agent Debate Framework for Fact Verification with Diverse Tool Augmentation and Adaptive Retrieval

February 2, 2026

Paper

ref: https://arxiv.org/pdf/2601.04742 Summary The team studied several methods to improve LLM accuracy and combined the strongest ideas to build Tool-MAD. The system…
Read breakdown
The Next Conversation

February 1, 2026

Book

Understanding is a superpower. Liking is optional. Jefferson Fisher teaches a simple but powerful idea: Argue less. Talk more The book changed…
Read breakdown
Small or Large? Zero-Shot or Finetuned? Guiding Language Model Choice for Specialized Applications in Healthcare

January 25, 2026

Paper

ref: https://arxiv.org/pdf/2504.21191 Summary The paper shows that finetuning still matters more than model size for well-defined tasks. The authors compared focused models…
Read breakdown
SLMFix: Leveraging Small Language Models for error fixing with Reinforcement Learning

January 18, 2026

Paper

ref: https://arxiv.org/pdf/2511.19422 Summary The paper suggest to train small language model (SLM) repair code for least known programming languages. The report over…
Read breakdown
Agint: Agentic Graph Compilation for Software Engineering Agents

January 11, 2026

Paper

ref: https://arxiv.org/pdf/2509.00625 webpage: https://www.agintai.com/ Summary Agint differs from workflow agents as a compiler agent (like IDE). It translates natural language into an…
Read breakdown
NetGent: Agent-Based Automation of Network Application Workflows

January 7, 2026

Paper

ref: https://arxiv.org/pdf/2509.00625 github: https://github.com/SNL-UCSB/netgent Summary The paper introduces a state machine logic, similar to how games operate, into the field of UI…
Read breakdown
Needle in the Web: A Benchmark for Retrieving Targeted Web Pages inthe Wild

January 5, 2026

Paper

ref: https://arxiv.org/pdf/2512.16553 github: https://github.com/Tango-Whiskyman/Needle_in_the_Web Summary Needle in the Web explores a new benchmark for evaluating LLM search agents. It uses a broadcast…
Read breakdown
ScreenAgent : A Vision Language Model-driven Computer Control Agent

December 29, 2025

Paper

ref: https://arxiv.org/pdf/2402.07945 github: https://github.com/niuzaisheng/ScreenAgent Summary Performed end2end LLM agent development by constructing a real desktop interaction environment through VNC, enabling the agent to…
Read breakdown