Skip to main content
Collections let you save groups of sources and reference them by name in API calls. Instead of listing multiple sources every time, create a collection once and reuse it across all your searches.
Collections are in beta. Create and manage them at platform.valyu.ai/user/collections.

What You Can Do

  • Save source combinations - Bundle frequently-used domains and datasets
  • Simplify API calls - Reference multiple sources with a single name
  • Share with your team - All org members can use your collections
  • Generate with AI - Let AI suggest sources based on your research needs

Creating a Collection

  1. Go to Collections
  2. Click Create Collection
  3. Add sources (Valyu datasets, domains, or URLs)
  4. Save with a memorable name
You can also use Generate with AI to automatically suggest relevant sources based on a description of your research needs.

Using Collections

Reference collections in your API calls using the collection: prefix in included_sources:
from valyu import Valyu

valyu = Valyu()
response = valyu.search(
    query="latest quarterly earnings reports",
    included_sources=["collection:my-finance-sources"]
)

Combining Collections and Sources

Mix collections with individual sources in the same request:
from valyu import Valyu

valyu = Valyu()
response = valyu.search(
    query="biotech company clinical trial results",
    included_sources=[
        "collection:medical-research",  # Your saved collection
        "techcrunch.com",                # Additional domain
        "valyu/valyu-patents"            # Additional dataset
    ]
)

Example Collections

Here are some collection ideas to get you started:

Financial Research Collection

Sources to include:
  • valyu/valyu-sec-filings - SEC regulatory filings
  • valyu/valyu-stocks - Stock market data
  • valyu/valyu-earnings-US - Earnings reports
  • reuters.com - Financial news
  • bloomberg.com - Market analysis

Medical Research Collection

Sources to include:
  • valyu/valyu-pubmed - Medical literature
  • valyu/valyu-clinical-trials - Clinical trial data
  • valyu/valyu-drug-labels - FDA drug information
  • nejm.org - New England Journal of Medicine
  • thelancet.com - The Lancet

Tech Documentation Collection

Sources to include:
  • docs.aws.amazon.com - AWS documentation
  • cloud.google.com/docs - Google Cloud docs
  • learn.microsoft.com - Microsoft docs
  • kubernetes.io/docs - Kubernetes docs
  • developer.mozilla.org - MDN Web Docs

AI-Generated Collections

Use the Generate with AI feature on the platform to automatically suggest sources. Describe your research needs in natural language, and AI will recommend relevant Valyu datasets and domains. For example, describe:
“I need sources for researching biotech companies, including SEC filings, medical research papers, and news from reliable financial sources.”
And AI will suggest appropriate sources like valyu/valyu-sec-filings, valyu/valyu-pubmed, valyu/valyu-clinical-trials, and trusted news domains.

Visibility & Permissions

  • All organization members can view and use any collection in their org
  • Only the creator can edit or delete their collection
  • Collections are scoped to your organization - they’re not visible to other orgs

Error Handling

If a collection doesn’t exist or isn’t accessible, the API returns a warning:
{
  "success": true,
  "warnings": ["Collection 'unknown-collection' not found"],
  "results": [...]
}
The search continues with any other valid sources you provided. If only an invalid collection was specified, you’ll get empty results with the warning.

Best Practices

Use Descriptive Names

Name collections clearly: medical-research, finance-sec-filings, tech-docs

Keep Collections Focused

Create multiple small collections rather than one large catch-all

Document Your Collections

Add descriptions so team members understand what each collection is for

Review Periodically

Update collections as new datasets become available or your needs change