The Anatomy of a Safety Claim
"X herb is dangerous" is not a safety determination — it is an assertion about evidence. Before evaluating the claim, you need to identify what kind of evidence it rests on, whether causality was assessed, and which biases may have inflated the apparent signal. These questions are not skepticism for its own sake; they are the minimum clinical epistemic standard.
What type of evidence is the claim citing?
A case report documents an adverse event in one patient. A cluster is several reports pointing toward a pattern. Case reports are signals, not proofs. They establish temporal association at best — the patient used the herb and then something happened. They cannot establish:
- Causation (the herb caused the event)
- Incidence (how often this happens per user)
- Risk compared to baseline (did exposure actually increase rate?)
Adverse event databases aggregate voluntary reports. A high count means the product has high use and/or high media attention around it — both inflate reporting independent of actual risk. There is no denominator (total users), no causality assessment, and systematic underreporting of routine events.
Animal hepatotoxicity studies use gavage (bolus delivery to a fasted animal) at doses that produce suprapharmacological human equivalents by body weight. The route, dose rate, fed/fasted state, and species-specific hepatic metabolism all interact. The preclinical failure rate for pharmaceutical candidates exceeds 90% — "no toxicity in rats" does not equal "safe in humans," and the inverse fails too.
Animal data is not useless. It generates coarse-grained signals worth investigating, and mechanistic data can be illuminating even when direct translation fails. The error is treating it as dispositive rather than preliminary.
Cohort studies track exposed vs. unexposed populations over time. They can establish incidence and relative risk, but confounding is still a major challenge in herbal research — users of botanical products often differ from non-users in health behaviors, polypharmacy, and baseline health status. Population selection can inflate the apparent signal for a subgroup whose risk actually belongs to their underlying condition or concomitant medications.
Causality Assessment Methods
Causality assessment asks: given this adverse event, how likely is it that this substance was responsible? Three method classes exist — each with different data requirements and tradeoffs. Note: the Naranjo scale is not a "method" — it is one algorithmic implementation, designed for pharmaceuticals and generally inapplicable to botanicals without modification. The class matters more than any specific instrument.
A trained clinician or panel evaluates all available evidence and reaches a narrative probability judgment. No formal scoring system.
Uses prior probability data and case details to compute a likelihood score. Produces a number with an inherent measure of uncertainty.
Structured decision tree with weighted yes/no criteria: temporal relationship, dechallenge, rechallenge, alternative explanation, overdose, prior reports.
Data Hierarchy for Causality Assessment
Not all data elements carry equal weight. Before choosing a method, classify what you have.
Bias Sources in Herbal Pharmacovigilance
What a Case Report Can and Cannot Establish
| Action | Case Reports CAN | Case Reports CANNOT |
|---|---|---|
| Causation | ✓ Generate a causal hypothesis worth investigating | ✗ Prove that the herb caused the event |
| Incidence | ✓ Suggest a pattern worth monitoring | ✗ Establish how often this occurs per 1000 users |
| Mechanism | ✓ Generate mechanistic hypotheses (esp. with dechallenge data) | ✗ Confirm pharmacological mechanism |
| Pharmacovigilance | ✓ Trigger regulatory review and product surveillance | ✗ Characterize the full risk profile of an herb |
| Market safety | ✓ Signal authentication failures and adulteration patterns in the market | ✗ Distinguish herb risk from product quality failure |
| Clinical guidance | ✓ Identify specific preparation forms or contexts requiring caution | ✗ Support a blanket contraindication for an herb category |
Green Tea Is Hepatotoxic
This is the claim. Work through the analytical layers below to understand how it is simultaneously grounded in real data and systematically misleading — and what a clinically useful answer actually looks like.
Case reports of hepatotoxicity have accumulated in the literature associated with green tea products. FDA, EMA, and several national agencies have issued safety communications. The phrase "green tea hepatotoxicity" appears across the regulatory record and in clinical reference databases. A student searching for herb-drug interactions will find this signal quickly.
Before you accept or reject this claim, you need to interrogate the evidence base for those four variables. A claim that skips this interrogation is not a safety determination — it is pattern recognition dressed as analysis.
The adverse event case reports that anchor "green tea hepatotoxicity" cluster in users of high-dose, concentrated EGCG extract supplements — not in drinkers of aqueous green tea infusion. The category label "green tea" absorbs a risk signal that belongs specifically to one preparation form.
This matters because it changes the risk assessment entirely:
Two variables interact to determine hepatic exposure: total dose and rate of delivery.
The fasting variable is critical. Most weight-loss supplements using GTE are marketed for use on an empty stomach. This is not coincidental — fasted-state use is a major confound in the hepatotoxicity case clusters. Rate of absorption affects whether you exceed a hepatotoxic threshold at all.
Animal hepatotoxicity data for green tea extract exists. The NTP gavage protocol is the most cited. Here is what it actually tells you:
Apply the same interrogation to animal data that you apply to human case reports: What was the dose? What was the route? What was the preparation form? Does this exposure scenario map onto any realistic human use pattern?
Authentication is not a regulatory checkbox. It is a logical precondition for interpreting any adverse event signal: if you do not know what was in the product, the event is unattributable.
Click each element to see what it requires:
Academic writing represents the literature accurately; clinical communication calibrates a client's decision under uncertainty. These are different tasks. "The evidence is complex and further research is needed" is accurate as a peer-review statement. It is a failure mode in a clinical encounter.
Aqueous green tea infusion at normal consumption: no credible hepatotoxicity signal.
Concentrated EGCG supplements, high dose, fasted state: real but rare signal, preparation and dose dependent.
Form of preparation is the actionable variable.
Kratom Overdose Deaths
Case reports of kratom-associated fatalities and overdose events have accumulated in the adverse event database and clinical literature. The same analytical framework that dismantled "green tea is hepatotoxic" applies here — with some important structural differences that make this case more complex.
FDA issued a public health advisory. The CDC database contains hundreds of kratom-associated reports. Several high-profile media cycles have presented kratom overdose deaths as evidence that kratom itself is acutely lethal. The phrase "kratom overdose" appears as if it is a pharmacological category equivalent to "opioid overdose."
A systematic review of the kratom overdose literature (Stanciu et al., 2023) found that the overwhelming majority of fatal cases attributed to kratom involved concomitant substances — predominantly opioids, benzodiazepines, or both. The question of what kratom itself does pharmacologically is distinct from the question of what happens when it is used alongside these agents in a vulnerable population.
This is structurally different from the green tea case. For green tea, the primary attribution error was preparation form. For kratom, the primary confound in the fatal case cluster is polypharmacy.
When applying the causality framework:
- Dechallenge and rechallenge are generally unavailable in fatal cases
- Alternative explanations (the co-ingestants) are frequently present and uncontrolled
- The temporal relationship is often ambiguous when multiple substances are co-ingested
- Toxicological confirmation of other substances was not always conducted or reported
The authentication problem for kratom is structurally identical to green tea — but with a higher-stakes adulterant profile.
The population using kratom in the United States is not representative of the general population, and this matters enormously for interpreting adverse event data.
This is the same structure as the green tea case — but with a higher-stakes version of the confound. If a person using kratom to manage heroin withdrawal dies with kratom alkaloids in their system, the causal question requires knowing:
- Were other opioids or CNS depressants also present?
- Was there relapse during the period of kratom use?
- Was the kratom product authenticated?
- Was the death mechanism consistent with kratom's pharmacology or with an opioid/benzodiazepine toxidrome?
Traditional kratom use in Southeast Asia involves brewing the fresh or dried leaf as a tea, consumed at culturally defined doses within a social and ritual context. This is pharmacologically different from the dominant US use patterns:
Kratom exists in a complicated regulatory and clinical space. It is not a scheduled substance federally (as of 2025), but several states have restricted it. The adverse event signal is real — it is simply not distributed where the media coverage implies it is.
Traditional/low-dose leaf preparation, no CNS depressant co-use: limited credible acute fatality signal from kratom pharmacology alone.
Extract products or polypharmacy context: real and serious risk, preparation/context dependent.
Adulterated product: the risk is the adulterant — authentication is non-negotiable before any risk estimate.
Form, context, and product quality are the actionable variables.
Green Tea vs. Kratom: Structural Parallels
The same analytical architecture applies to both cases. The variables differ; the framework is identical.
| Variable | Green Tea | Kratom |
|---|---|---|
| Primary attribution error | Preparation form: EGCG extract risk attributed to "green tea" category | Polypharmacy: co-ingestant risk attributed to "kratom" category |
| Authentication confound | Adulteration, solvent residuals, IPPSC failure in case-report products | Synthetic opioid adulteration (including fentanyl) in kratom products |
| Dose-form gradient | Tea → extract with food → extract fasted (increasing risk) | Leaf tea → powder → extract (increasing concentration and dose uncertainty) |
| Population confound | Weight-loss supplement users have different health profiles from tea drinkers | Opioid withdrawal self-medication context; baseline risk is elevated independent of kratom |
| Animal data issue | Gavage, fasted, suprapharmacological doses; poor translation to human infusion | Rodent alkaloid studies; route and dose problems; MOR binding confirmed but overdose mechanism in humans involves polypharmacy context |
| Denominator problem | Billions of daily tea drinkers; supplement users much smaller but poorly characterized | Total US kratom users poorly characterized; fatal cases not adjusted for exposure prevalence |
| Signal assessment | Real signal in EGCG extract + fasted context; near-zero signal for aqueous infusion | Real signal in extract + polypharmacy context; signal in traditional use context is limited and poorly separated from confounders |
| Clinical bottom line | Preparation form is the actionable variable | Preparation form + co-ingestion context are both actionable variables |
Portable Checklist
Apply these eight questions to any claim that an herb is dangerous. They are not a shortcut — they are the minimum analytical standard. Click each item as you work through it.
(1) What does the evidence actually implicate — herb category, preparation form, or specific use context?
(2) Were causality and authentication standards met in that evidence?
(3) Does your client's actual use pattern match the implicated scenario?
Only after answering all three can you make a clinically useful risk statement.
Framework grounded in: Barnes (2003); Meyboom et al. (1997); Edwards (2017); Théophile et al. (2010, 2012); Agbabiaka et al. (2008); Stanciu et al. (2023); EFSA/EMA GTE assessments; WHO-UMC causality criteria. IPPSC as applied in botanical pharmacovigilance per USP and regulatory guidance.