RLee
About
Ryan Lee
Categories
All
(3)
attention
(1)
benchmarks
(1)
extraction
(2)
llm
(3)
web-extraction
(2)
⚠️ I am in the progress of migrating my website. Thank you for your patience!
Mistral 7B
llm
attention
Improving model performance often requires training bigger models. However, bigger models result in higher computational cost and inference latency, which could prohibit…
Aug 10, 2025
Ryan Lee
NEXT-EVAL: Next Evaluation of Traditional and LLM Web Data Record Extraction
extraction
web-extraction
llm
benchmarks
Web data record extraction is a problem of extracting repeated sets of semantically related elements from web pages. Effective evaluation of web data record extraction is…
Jun 20, 2025
Ryan Lee
XPath Agent: Automating Web Scraping with LLM‑Built XPaths
extraction
web-extraction
llm
Writing durable XPath for web scraping is time-consuming and brittle. A good XPath must work across multiple page variants, not just one.
XPath Agent
(Yu Li, Bryce Wang…
Jun 19, 2025
Ryan Lee
No matching items