Research

Research at RunoxAI

We publish our work on neural retrieval, evaluation, and AI-native search. Our research directly powers the products we ship — and we believe in sharing it with the broader community.

Research areas

The problems we are focused on

Neural Retrieval

Advancing the state of the art in dense retrieval and re-ranking, with a focus on semantic understanding and real-world query diversity.

Structured Extraction

Extracting structured data (companies, people, events) from unstructured web content at scale, with high precision and recall.

Evaluation & Benchmarks

Building rigorous evaluation frameworks that measure what actually matters for AI applications, not just traditional IR metrics.

Latency & Scalability

Pushing the limits of what is possible at low latency and global scale, from index construction to serving billions of queries.

Publications

Selected papers from the RunoxAI research team

NeurIPS 2024December 2024

Neural Retrieval at Scale: Semantic Search for Large Language Models

Priya Mehta, Jordan Kim, Alex Rivera

We present a novel approach to web-scale neural retrieval that achieves state-of-the-art performance on standard benchmarks while maintaining sub-200ms latency. Our method combines dense retrieval with a proprietary re-ranking stage trained on billions of real-world queries.

RetrievalScalabilityLLMs
Read paper
EMNLP 2024October 2024

WebBench: A Comprehensive Evaluation Suite for Web Search Quality

Priya Mehta, Sarah Williams

Existing search evaluation frameworks focus on keyword-based retrieval. We introduce WebBench, a benchmark suite specifically designed to measure the quality of semantic and neural web search, covering diverse query types, freshness requirements, and domain-specific tasks.

EvaluationBenchmarksNLP
Read paper
ACL 2024August 2024

Highlight Extraction: Token-Efficient Retrieval for Agent Applications

Jordan Kim, Priya Mehta

We study the problem of extracting the most relevant text passages from web pages given a user query, with the explicit goal of minimizing token consumption in downstream LLM applications. Our approach reduces token usage by up to 75% while maintaining answer quality.

HighlightsToken EfficiencyAgents
Read paper
WWW 2024May 2024

LiveCrawl: Real-Time Web Content Acquisition for AI Applications

Sarah Williams, Marcus Chen

Stale cached data is a major limitation for AI applications that require up-to-date information. We describe LiveCrawl, a system that fetches and processes web pages in real time, enabling AI agents to access the most current information available on the web.

Web CrawlingReal-TimeInfrastructure
Read paper
SIGIR 2023July 2023

Company Search: Structured Entity Retrieval from Unstructured Web Data

Priya Mehta, Alex Rivera, David Park

We present a system for retrieving structured company information from the web, integrating entity recognition, cross-document inference, and confidence estimation. Our approach powers RunoxAI's company search vertical, which serves thousands of enterprise customers.

Entity RetrievalStructured DataEnterprise
Read paper

Join our research team

We are looking for researchers and engineers who want to work on foundational problems in information retrieval and AI. Come build with us.