Research breakdowns, practical implementation notes, and opinionated takes from real-world data and AI work.
-
Coverage – Aware Web Crawling for Domain – Specific Supplier Discovery via a Web – Knowledge – Web Pipeline
ref: https://arxiv.org/pdf/2602.24262 Summary Most knowledge graphs fail in production because they are incomplete and hard to maintain. This paper flips that weakness…
-
How to Train Your Deep Research Agent? Prompt, Reward, and Policy Optimization in Search-R1
ref: https://arxiv.org/pdf/2602.19526v1 Summary I used to think longer prompts and step-by-step reasoning always make AI smarter. In daily use, that often helps.…
-
Prompt Repetition Improves Non-Reasoning LLMs
ref: https://arxiv.org/pdf/2512.14982 Summary The easiest AI upgrade might be just “copy and paste”. Google researchers tested with 7 leading models and found…
-
Verbalized Sampling: How to mitigate Mode Collapse and unlock LLM diversity
ref: https://arxiv.org/pdf/2510.01171 Summary AI models repeat safe answers, but a simple prompt tweak can unlock far better thinking. Researchers introduced “verbalized sampling,”…
-
Tool-MAD: A Multi-Agent Debate Framework for Fact Verification with Diverse Tool Augmentation and Adaptive Retrieval
ref: https://arxiv.org/pdf/2601.04742 Summary The team studied several methods to improve LLM accuracy and combined the strongest ideas to build Tool-MAD. The system…
-
The Next Conversation
Understanding is a superpower. Liking is optional. Jefferson Fisher teaches a simple but powerful idea: Argue less. Talk more The book changed…
-
Small or Large? Zero-Shot or Finetuned? Guiding Language Model Choice for Specialized Applications in Healthcare
ref: https://arxiv.org/pdf/2504.21191 Summary The paper shows that finetuning still matters more than model size for well-defined tasks. The authors compared focused models…
-
SLMFix: Leveraging Small Language Models for error fixing with Reinforcement Learning
ref: https://arxiv.org/pdf/2511.19422 Summary The paper suggest to train small language model (SLM) repair code for least known programming languages. The report over…
-
Agint: Agentic Graph Compilation for Software Engineering Agents
ref: https://arxiv.org/pdf/2509.00625 webpage: https://www.agintai.com/ Summary Agint differs from workflow agents as a compiler agent (like IDE). It translates natural language into an…
-
NetGent: Agent-Based Automation of Network Application Workflows
ref: https://arxiv.org/pdf/2509.00625 github: https://github.com/SNL-UCSB/netgent Summary The paper introduces a state machine logic, similar to how games operate, into the field of UI…
-
Needle in the Web: A Benchmark for Retrieving Targeted Web Pages inthe Wild
ref: https://arxiv.org/pdf/2512.16553 github: https://github.com/Tango-Whiskyman/Needle_in_the_Web Summary Needle in the Web explores a new benchmark for evaluating LLM search agents. It uses a broadcast…
-
ScreenAgent : A Vision Language Model-driven Computer Control Agent
ref: https://arxiv.org/pdf/2402.07945 github: https://github.com/niuzaisheng/ScreenAgent Summary Performed end2end LLM agent development by constructing a real desktop interaction environment through VNC, enabling the agent to…







