Starting off as a muggle that naïve to the Math's and Data Science world.

Blog


  • Agint: Agentic Graph Compilation for Software Engineering Agents

    Agint: Agentic Graph Compilation for Software Engineering Agents

    ref: https://arxiv.org/pdf/2509.00625 webpage: https://www.agintai.com/ Summary Agint differs from workflow agents as a compiler agent (like IDE). It translates natural language into an executable, result-oriented DAG, where operating at the graph level enables parallel execution and removes the constraint of linear, chain-based generation. This approach emphasizes result-oriented execution: workflows are not fixed upfront but are dynamically…

  • NetGent: Agent-Based Automation of Network Application Workflows

    NetGent: Agent-Based Automation of Network Application Workflows

    ref: https://arxiv.org/pdf/2509.00625 github: https://github.com/SNL-UCSB/netgent Summary The paper introduces a state machine logic, similar to how games operate, into the field of UI automation. It compiles natural language into reusable, iterative states, where each state uses LLM-based reasoning for action selection and execution. This approach extends ReAct by adding explicit state memory and caching (compile-then-replay), reducing…

  • Needle in the Web: A Benchmark for Retrieving Targeted Web Pages inthe Wild

    ref: https://arxiv.org/pdf/2512.16553 github: https://github.com/Tango-Whiskyman/Needle_in_the_Web Summary Needle in the Web explores a new benchmark for evaluating LLM search agents. It uses a broadcast + parallel retrieval approach (fuzzy exploratory search) instead of traditional multi-hop reasoning. Retrieved webpages are verified to ensure all query criteria are satisfied using single source, selecting a “ground-truth” page for answer generation.…

  • ScreenAgent : A Vision Language Model-driven Computer Control Agent

    ref: https://arxiv.org/pdf/2402.07945 github: https://github.com/niuzaisheng/ScreenAgent Summary Performed end2end LLM agent development by constructing a real desktop interaction environment through VNC, enabling the agent to perceive screenshots and issue mouse and keyboard actions. A UI automation process was introduced, with actions formalized as function calls and organized into planning, action, and reflect loops. Within the acting and reflecting…

  • Finetuning LLMs for Automatic Form Interaction on Web-Browser in Selenium Testing Framework

    ref: https://arxiv.org/pdf/2511.15168 Summary Trained a new LLM to understand web form (HTML code) and generate reliable Selenium script for webpage testing. Differ to WebVoyager, which rely on visual navigation (UI Agent). This method model UI automation to a deterministic code generation working on UI/UX. While this improve execution, the approach still suffer typical code generation failure.…

  • The Iceberg Index: Measuring Skills-centered Exposure in the AI Economy

    ref: https://arxiv.org/pdf/2510.25137 Summary Performed a workforce “digital twin” simulation on human capabilities overlap with AI. Existing workforce metrics does not work on AI assisted task/skill. A newly metric is introduce to identify task/skill correlate to wages. While tech role (programmer, data science & program manager) already disputed, repetitive cognitive and administrative work remain largely invisible.

  • The courage to be disliked

    The courage to be disliked

    The book placed a mirror in front of me, I saw not who I was, but how I had been living. Etiology says “I caught a cold because I played in the rain, therefore I’ll never cure”, but Teleology say “I chose to play in the rain, so catching a cold is simply what followed.”…

  • Dirtiest Business Trick

    Dirtiest Business Trick

    How to start the right business — even if you have no idea what to do. Zhang had a shoplot but no business plan. He turned to his friend Lee for advice. Lee suggest him a trick:“Put up a banner offering the shoplot at below-market rent. When people inquire, ask about their business idea. Say…

  • Sell me this pen

    Sell me this pen

    Link: Instagram Round 1 Customer: Ok, i wanna try something, sell me this pen.Salesman: Sir, this is a marker. (Created a tailwind) Round 2 Customer: Okay, sell me that marker.Salesman: Do you need a marker? (Checking for a demand)Customer: Well, i actually don’t need a marker right now. Round 3 Salesman: Then why do you…

  • Predicting 100% in IRIS dataset

    Predicting 100% in IRIS dataset

    dataset: https://archive.ics.uci.edu/dataset/53/iris While scrolling through YouTube, I came across this video:https://www.youtube.com/watch?v=MdOCu2Gr-0g It explores Fibonacci numbers, which sparked a thought—could I experiment with them in a unique way, perhaps using the Iris dataset? First, let’s create a sequence of Fibonacci numbers. Result: Next, we create a function that returns the largest Fibonacci number closest to a…

  • GDG DevFest KL 2024

    When: Sat 2024-12-07 9am – 6pm (MYT)Where: Level 2, Menara Ken TTDI, 37, Jalan Burhanuddin Helmi, Taman Tun Dr Ismail, 60000 Kuala Lumpur, Wilayah Persekutuan Kuala Lumpur Malaysia

  • AI Tinkerers Kuala Lumpur #6

    When: Mon 2024-11-28 6pm – 9pm (MYT)Where: Level 8, Menara Southpoint @ Mid Valley City, Medan Syed Putra Selatan, 59200 Kuala Lumpur , Malaysia Continue to learn the way of communication WRONG EXAMPLE Kim: Hi, I’m Kim Chye from Western Digital. [I did not ask how are you, or saying great speech]B: Hey, so what…

  • Quiz

    Quiz

    Saw a challenge in LinkedIn about creating a tree and here is my try =P resolution from the internet.

  • Multilevel Data Analysis

    Day 94 Multilevel AnalysisContextual AnalysisMixed Effects Models Day 95 Tutorial Day 96 Data collected wronglyType of mistake occur Day 97 Variable Proposition Day 98, Day 98 (2) TutorialStatistical Treatment of Clustered DataMethod– Aggregation– Disaggregation Day 99, Day 99 (2), 101 (2), 103 DisaggregationHow to avoid? Using two-stage sample.Looking for correlationCorrelation Analysis (Intraclass Correlation-ICC)F-testLinear RegressionTotal Regression, Regression…

  • Day 106

    Question 1 As a variable at the student level that is essential for explaining Economics score, we use the measure for revision hours per month taken from a study. The revision hours has been centered, so that its mean is 0. The results are presented below. Revision hours here is the variable with overall centering…