How Our Data Works
Transparency is core to NutriBalance. When you make dietary decisions based on our data, you deserve to know exactly where it comes from, how confident we are, and what review process it went through.
The Data Pipeline
Source
Data is imported from government databases, scientific papers, or researched by AI.
Observe
Each value becomes a substance observation — tagged with its source, derivation method, and initial confidence score.
Evidence
Observations link to evidence items: page references, row locators, URLs, and text snippets from original sources.
Review
AI-generated and low-confidence observations go through human review. Reviewers can approve, reject, or request revision.
Resolve
Approved observations become resolved values — the trusted numbers you see on food pages, with a final confidence score and source summary.
Where Data Comes From
Government Databases
Highest trustPrimary data from USDA FoodData Central, national food composition databases, and peer-reviewed government nutrition references. These are the gold standard for nutritional values.
Trust level: 90–100
Scientific Literature
High trustPublished research papers, clinical studies, and meta-analyses on substance retention, bioavailability, and food composition. Used especially for cooking-method adjustments.
Trust level: 80–95
Industry Data
Moderate trustManufacturer-provided nutritional labels and industry food composition databases. Cross-referenced with other sources where possible.
Trust level: 60–80
AI-Researched
Review requiredWhen data is missing from primary sources, our AI agent researches scientific and government databases to fill gaps. All AI-extracted values are flagged for human review before being fully trusted.
Trust level: 40–85
Confidence Scores
Every substance value has a confidence score from 0 to 100. This tells you how reliable the data point is. On food pages, you see the average confidence across all tracked substances.
Directly from government databases or verified analytical data.
Strong scientific evidence with minor estimation involved.
Reasonable estimates from related foods, AI research, or calculated from components.
Limited data available. Values are rough estimates and should not be used for critical medical decisions.
How Values Are Derived
Not all nutritional values come from the same method. Each substance observation is tagged with its derivation type so you know how the number was obtained.
Measured in a laboratory from actual food samples.
Computed from known ingredient compositions using standardized formulas.
Inferred from similar foods or related entries in reference databases.
Filled from a closely related food when no direct data exists.
Researched by our AI agent from scientific sources. Always marked for human review.
The Review Process
Every substance value goes through a review cycle before becoming a trusted resolved value:
Your Feedback Matters
On every food page, you can flag inaccurate data or suggest corrections. Each submission links to the specific substance and variant, and our reviewers investigate every report. If you have a source URL or research to back up your correction, even better.
Questions about our data? Found something inaccurate?
Create an account and report it