SCROLL
What Data We Serve
Every record carries five fields: description, keywords, metadata, page_content, and data. Gold labels enable fast triage — agents filter thousands of records without reading full content.
SEC Filings
1,000 stocks · 10 years
- 10-K, 10-Q, 8-K filings
- Earnings transcripts
- Company presentations & slides
- Page-level keyword search
Endpoints:search_documents · get_document_pages · get_document_outline
Clinical Trials
300+ biotech · 48,291 studies
- ClinicalTrials.gov full data
- Gold labels: signal_tier, signal_score
- Phase tracking (1→2→3→approved)
- Drug name + condition search
Endpoints:search_clinical_trials · get_clinical_trial
FDA Approvals
12,847 applications
- NDA, BLA, ANDA approvals
- Priority review & breakthrough
- Orphan designation tracking
- Gold labels: approval_tier, regulatory_signal
Endpoints:search_fda_approvals · get_fda_approval
FAERS Adverse Events
892,441 reports
- Drug-reaction pairs
- Seriousness classification
- Patient demographics
- Reporter type filtering
Endpoints:search_faers_events · get_faers_event
EIA Petroleum
Weekly inventory + spot prices
- Cushing crude, PADD districts
- WTI, Brent, RBOB prices
- Supply signal: bullish/bearish
- Weekly change tracking
Endpoints:search_eia_inventory · search_eia_prices
Vessel Tracking
2,847 tankers live
- AIS position data
- Crude/product/LNG tankers
- Port visit history
- Bounding box spatial queries
Endpoints:search_vessel_positions · search_ports
Weather Signals
200 locations · 10 years
- ERA5 historical baselines
- Temperature anomaly detection
- Energy demand signals
- Heating/cooling degree days
Endpoints:get_weather_baseline
Agentic Retrieval
Unified across all sources
- 4-step pattern: list → outline → search → read
- 99%+ retrieval accuracy
- Works for PDFs, trials, filings
- Row-level page_content access
Endpoints:list_sources · read_source_outline · search_keyword_in_source · read_source_pages
Agent-Use-Ready Format
Every record returns five fields optimized for LLM consumption. Agents triage with description + keywords, then fetch page_content only when needed.
description
One-line summary for fast triage. Agents scan 100 descriptions in ~2K tokens.
keywords
Up to 10 high-signal terms. Enables keyword matching without reading content.
metadata
Flat key-value pairs: ticker, date, phase, nct_id. Structured for filtering.
page_content
Full markdown rendering. The text agents feed to LLMs for deep reasoning.
data
Complete silver record with all columns. For programmatic access.
Start Querying
48 endpoints. 10M+ pages. 5,000 RPS. Free tier: 1,000 calls/month.