Methodology

How The Clinical Index Works

Full transparency into our evidence verification process, scoring methodology, and quality assurance pipeline.

Our Philosophy

We believe in radical transparency

The supplement industry is plagued by opaque proprietary blends and unsubstantiated marketing claims. We take the opposite approach: our scoring formula, weights, and methodology are entirely public. If you can reproduce our inputs, you can reproduce our outputs.

PMID-Verified Citations Only

Every study referenced in a TCI dossier is indexed in PubMed and identified by its PMID. We never generate synthetic citations. Each PMID is validated against the NCBI database before inclusion.

Deterministic Scoring

All numeric scores are computed by deterministic Python code with fixed weights. LLM agents produce qualitative narrative only. The same input always produces the same score.

Full-Text Evidence Extraction

We go beyond abstracts. Our pipeline retrieves full-text articles from PubMed Central and Unpaywall, extracting specific passages that support or contradict each ingredient claim.

The Pipeline

11 Specialized AI Agents, One Sequential Pipeline

Every product analysis passes through eleven purpose-built agents in strict sequence. Each agent has a single responsibility, uses a specific AI model, and produces structured output for the next stage.

01

IntakeAgent

Haiku 4.5

Parses supplement facts panels and label images. Extracts every ingredient, its dose, unit, and form (e.g., magnesium glycinate vs. magnesium oxide). Normalizes naming conventions across brands.

02

IngredientClassifier

Deterministic

Categorizes each ingredient by functional role: PRIMARY (active therapeutic), SUPPORTING (cofactors, bioavailability enhancers), or OTHER (excipients, fillers). This agent uses rule-based logic with zero LLM calls for complete reproducibility.

03

EvidenceResearchAgent

Haiku 4.5

Searches PubMed for 15-30 clinical studies per ingredient. Retrieves full-text articles via a PMC/Unpaywall cascade, prioritizing randomized controlled trials and systematic reviews. Extracts dosage data, population info, and outcome measures.

04

FormulationAnalysisAgent

Haiku 4.5

Evaluates whether each ingredient is dosed at clinically meaningful levels by comparing label doses against therapeutic ranges established in peer-reviewed literature. Flags under-dosed and over-dosed ingredients.

05

SafetyAssessmentAgent

Haiku 4.5

Checks each ingredient against Tolerable Upper Intake Levels (ULs) established by the National Academies. Identifies known drug interactions, contraindications, and population-specific warnings (pregnancy, pediatrics, elderly).

06

ClaimsValidationAgent

Haiku 4.5

Cross-references marketing claims made on the product label against the evidence base assembled by the EvidenceResearchAgent. Determines whether each claim is Supported, Partially Supported, or Unsupported by the literature.

07

DustingDetector

Haiku 4.5

Identifies "fairy-dusted" ingredients included at sub-clinical doses, typically less than 5% of the minimum effective dose found in clinical literature. Flags ingredients that appear on the label for marketing value rather than therapeutic benefit.

08

FTCComplianceAgent

Haiku 4.5

Scans label claims, product names, and marketing language for FTC and FDA regulatory violations. Detects disease claims, unsubstantiated "clinically proven" language, and structure/function claims missing required disclaimers.

09

COAVerificationAgent

Haiku 4.5

When a Certificate of Analysis is provided, validates that tested ingredient quantities match label claims within acceptable variance. Checks for heavy metals, microbial contamination, and identity verification against COA data.

10

SynthesisAgent

Sonnet 4

The only agent running on Claude Sonnet 4, our most capable model. Synthesizes outputs from all preceding agents into a coherent executive summary narrative. Produces the final dossier text while maintaining scientific accuracy.

11

QAReviewAgent

Haiku 4.5

Final quality assurance pass. Validates that all citations are real, scores are consistent with evidence, and the narrative accurately reflects the data. Catches edge cases and ensures fairness across product categories.

Scoring

Scientific Credibility Score (SCS)

Every product receives a composite score from 0 to 100, computed entirely by deterministic Python code. LLMs contribute narrative analysis but never influence the numeric score.

Without Certificate of Analysis

The standard scoring model applied to all products analyzed without third-party lab testing data.

35%

Evidence Quality

Strength, relevance, and volume of PubMed studies supporting each ingredient.

30%

Dose Adequacy

Whether ingredients are dosed within clinically effective ranges from published trials.

20%

Bioavailability

Quality of ingredient forms (e.g., chelated minerals vs. oxides, methylated B vitamins).

15%

Safety

Absence of Tolerable Upper Limit violations, contraindications, and interaction risks.

With Certificate of Analysis

When a brand provides third-party lab test results, the scoring model adds a Label Accuracy dimension and rebalances weights.

30%

Evidence Quality

Strength, relevance, and volume of PubMed studies.

25%

Dose Adequacy

Clinical dose alignment from peer-reviewed literature.

20%

Label Accuracy

How closely tested quantities match label claims (COA verification).

15%

Safety

UL compliance, drug interactions, contraindications.

10%

Bioavailability

Ingredient form quality and absorption characteristics.

Grade Scale

A+
95 - 100
A
90 - 94
A-
85 - 89
B+
80 - 84
B
75 - 79
B-
70 - 74
C+
65 - 69
C
60 - 64
C-
55 - 59
D
40 - 54
F
< 40

Verification Badge

9 Gates to Badge Eligibility

A product must pass every single gate to earn the TCI Verification Badge. There are no exceptions and no overrides. Failing even one gate disqualifies a product from badge eligibility.

1

Scientific Credibility Score of 75 or above

2

Each scoring dimension individually scores 40 or above

3

Safety assessment passed with no critical flags

4

QA review passed without major issues

5

Overall analysis quality is 70% or higher

6

FTC/FDA compliance score is 70 or higher

7

At least one evidence-based primary ingredient present

8

Confidence level is above LOW threshold

9

No critical quality issues detected in final review

Evidence Standards

How We Handle Evidence

Our evidence pipeline is designed to maximize accuracy and minimize the risk of AI-generated hallucinations.

PubMed-Indexed Studies Only

We exclusively source studies from PubMed, the gold standard for biomedical literature. Pre-prints, blog posts, manufacturer-funded white papers, and non-indexed journals are excluded from the evidence base.

Full-Text Extraction

When available through PubMed Central or Unpaywall, we retrieve and analyze the complete article text rather than relying solely on abstracts. This provides access to methodology details, dosage protocols, and result nuances that abstracts omit.

Hallucination Verification

Every citation generated by our AI agents is verified against the NCBI database. PMIDs are checked for existence, and extracted claims are fuzzy-matched against source text with a similarity percentage. Citations that cannot be verified are automatically removed.

Study Quality Weighting

Not all studies carry equal weight. Randomized controlled trials (RCTs) and systematic reviews receive the highest weight, followed by observational studies. In vitro and animal studies are noted but weighted significantly lower for human health claims.

Our Commitment

Transparency Is Non-Negotiable

Weights Shown in Every Dossier

Every dossier displays the exact scoring weights used. You can see precisely how each dimension contributed to the final score.

Reproducible Results

Same product input produces the same score output. Our deterministic scoring engine eliminates LLM randomness from numeric results.

No Pay-to-Play

You cannot buy a higher score. The scoring algorithm treats every product identically regardless of which brand submitted it.

Open Methodology

This page is our commitment. Every weight, every gate, every rule that affects your score is documented publicly.

Ready to know what the evidence actually says about your product?

Free to start. No credit card required.