Skip to main content
Search PubMed’s 37M+ biomedical and life sciences papers through Valyu’s unified API. Get full-text semantic search with author metadata, citations, and DOIs.

Dataset Overview

PropertyValue
Source IDvalyu/valyu-pubmed
Size37M+ papers
CoverageMedicine, genetics, pharmacology, epidemiology, life sciences
UpdatesMonthly
Data TypeUnstructured (full-text)

What You Get

  • Full-text search - Search across abstracts and paper content, not just titles
  • Author metadata - Author names, affiliations, and ORCID IDs
  • Citation data - DOIs, citation counts, and reference links
  • Publication dates - Filter by date ranges for time-sensitive research
  • Semantic ranking - Results ranked by relevance to your query

Quick Start

from valyu import Valyu

valyu = Valyu()

response = valyu.search(
    "CRISPR gene editing therapeutic applications",
    search_type="proprietary",
    included_sources=["valyu/valyu-pubmed"],
    max_num_results=10
)

for result in response.results:
    print(f"Title: {result.title}")
    print(f"Authors: {', '.join(result.authors) if result.authors else 'N/A'}")
    print(f"DOI: {result.doi or 'N/A'}")
    print(f"Content: {result.content[:300]}...")

Use Cases

  • Literature reviews - Comprehensive searches across biomedical literature
  • Drug discovery research - Find studies on compounds, targets, and mechanisms
  • Clinical evidence - Locate clinical studies and treatment outcomes
  • Systematic reviews - Gather sources for meta-analyses
  • Medical AI training - Build datasets for healthcare AI applications

Combine with Other Sources

PubMed works well combined with other healthcare datasets:
response = valyu.search(
    "CAR-T cell therapy clinical outcomes",
    search_type="proprietary",
    included_sources=[
        "valyu/valyu-pubmed",
        "valyu/valyu-clinical-trials",
        "valyu/valyu-biorxiv"
    ],
    max_num_results=20
)