Ryan Lee – RLee

⚠️ I am in the progress of migrating my website. Thank you for your patience!

NEXT-EVAL: Next Evaluation of Traditional and LLM Web Data Record Extraction

extraction

web-extraction

llm

benchmarks

Web data record extraction is a problem of extracting repeated sets of semantically related elements from web pages. Effective evaluation of web data record extraction is…

XPath Agent: Automating Web Scraping with LLM‑Built XPaths

extraction

web-extraction

llm

Writing durable XPath for web scraping is time-consuming and brittle. A good XPath must work across multiple page variants, not just one. XPath Agent (Yu Li, Bryce Wang…