
Human kind?
The first check before every decision.
For centuries, we've made decisions through four filters: cost, time, convenience, risk. HI Grade adds the missing one β verified human impact.
Human kind? Now you can find out.
β’ Mobile App FREE




The HI Gradeβ’ in your pocket. Available free on iOS. Search, Siri, Widget, Shortcuts.
Four ways to use HI Grade without opening the app.
On the roadmap. Not yet built.
β’ APIIntegrate HI Gradesβ’ into procurement platforms, investment tools, ESG dashboards, and consumer products.
700+ companies Β· 42 data sources Β· 32 endpoints
HUMAN Heartbeat Β· HUMAN Genome Β· HUMAN 100 Indexβ’ Β· HUMAN Lens Β· Real-time decay detection
Free to use Β· No API key required Β· 100 calls/day
All 32 endpoints are free and open. Sufficient for most journalism, research, and prototyping.
Real-time score decay detection.
The HUMAN Heartbeat monitors 42 data sources for signals that a company's HI Grade may be about to change. It tracks layoff surges, AI acceleration pivots, ethics controversies, CEO accountability issues, and environmental incidents.
Each company gets a Decay Index from 0 to 100. Companies above 30 are flagged as warnings. Above 50 is critical.
The top 100 most human public companies.
ETF-licensable. Rebalanced monthly. Methodology: HUMAN Grade Spec v1.2.0.
The HUMAN 100 Backtest tracks whether companies that score highest on human consciousness, empathy, ethics, environmental care, and transparency also deliver better returns.
Daily score snapshots and price data are being collected. The backtest will go live once we have 30+ trading days of validated data.
The thesis: Companies that treat humans well should outperform companies that don't. The HUMAN 100 Backtest measures that hypothesis daily with real market data. No cherry-picking. No survivorship bias. Just the math.
Five things ESG was never built to see.
ESG was designed for investors managing risk. HI Grade was built to measure humanity in companies. They examine different things, ask different questions, and sometimes reach very different conclusions about the same company.
AI Balance β does the company use AI to augment humans, or to replace them?
Companies with deep craft, genuine empathy, principled leadership, and transparency have moats that AI cannot cross.
Side-by-side HI Gradeβ’ comparison. See where the balance differs.
How one company's ethics ripple through an entire industry.
When a market leader cuts thousands of jobs, the whole industry feels pressure to follow. When a company invests in human craft, competitors have to match it or look bad. HUMAN Contagion measures this ripple effect β how individual company behavior spreads across industries.
Each company's contagion score is weighted by market influence, headcount, and industry position. A company with high contagion and low HI Grade is dragging its entire industry down. A company with high contagion and high HI Grade is lifting everyone up.
Detecting real empathy vs. performative empathy.
A company scores 100/100 on the HRC Corporate Equality Index. It publishes a beautiful DEI report. It wins awards for inclusion. But its Glassdoor rating is 2.8 and employees say management doesn't care. That's performative empathy β and the HUMAN Watermark detects it.
The watermark cross-references external signals (DEI scores, HRC ratings, CDP disclosures) against internal reality (employee reviews, CEO accountability, headcount trends). When the gap is wide, the empathy isn't real.
Your personal ethical footprint. How conscious are your choices?
Every company you buy from, subscribe to, or work for has a HI Grade. Your portfolio of companies creates a personal Consciousness Score β a measure of how human your daily footprint is.
Three tiers: Highly Conscious (avg β₯ 60), Conscious (avg 40β59), and Unaware (avg < 40).
What if you use these 5 companies daily?
Collective market pressure. Where the wave is building.
Individual HI Grades tell you about one company. The HUMAN Wave tells you about all of them. It aggregates scoring data across industries and dimensions to detect where market pressure is building β which sectors are under the most scrutiny, which dimensions are failing system-wide, and where consumer consciousness is creating unstoppable momentum for change.
When enough companies in an industry score poorly on the same dimension, the Wave signals that the entire industry needs to shift. Companies can't ignore collective pressure.
When humanity erodes. Combined warning signals from Heartbeat, Shield, Contagion, Lens, and Watermark.
HUMAN Decline aggregates five orthogonal warning systems into one profile per company:
Every company's unique sub-signal fingerprint. Click any company to see what drives their score.
The HUMAN Genome breaks each dimension into individual measurements β Workforce Valuation, Craft, Stakeholder Governance, Energy & Emissions, and more. It's the DNA of a company's humanity. No two genomes are the same.
Click a company to explore its full genome on the detail page.

Human kind?
For centuries, we've made decisions through four filters: cost, time, convenience, risk. HI Grade adds the missing one β verified human impact.
Human kind? Now you can find out.
started on Earth Day 2025 with a simple question: if AI is changing everything, who's measuring what we're losing?
I'd spent years in technology β enough to know what AI could do, and enough to worry about what it was replacing. Not just jobs. The craft behind the work. The empathy in the service. The humans in the loop.
I couldn't find a tool that measured any of that. So I built one. The methodology is open source because a transparency framework that hides its own math would be hypocritical. The name is
because it's both a greeting and a question: is it Human Intelligence, or Artificial?
Don't panic. Every journey starts somewhere. The data will get better. The ladders will get grounded. The companies will adapt. That's the point.

We'd love to hear from you.
See what drives your score across all five HUMAN dimensions. Want Gold? We'll share your current gates and what it takes.
Earn Gold HI Grade β license the badge for your website, packaging, and job listings.
Don't see a company you care about? Tell us. Have feedback? Ideas? Questions? We're humans too.
Industry reports, API licensing, research partnerships.
The math behind being human kind. Spec v1.2.1 Β· May 2026
Every company is scored from 0 to 100. The score is the average of five HUMAN dimensions, each scored 0β100. Higher means more human. The Balanced Board highlights companies where all five dimensions score β₯ 60.
HI Grade scores companies across five dimensions. Together they spell HUMAN:
Every score on thehibalance.org traces back to these formulas. 19 active sub-signals. 5 deferred for v1.2. Open the table for any dimension to see the math.
| ID | Name | Signal | Source | Status |
|---|---|---|---|---|
| H.1 | Workforce Valuation | Layoffs, WARN filings, revenue-per-employee ratio | WARN, SEC 10-K, BLS | PARTIAL |
| H.2 | Craft | Apprenticeship, training investment, craftsmanship signals | Glassdoor, Jobs, BLS | UNGROUNDED |
| H.3 | Human Decision Depth | AI-disclosed roles vs human roles, decision authority | SEC 10-K, Job Boards | UNGROUNDED |
| H.4 | Leadership Continuity | CEO turnover, board stability | β | DEFERRED |
| H.5 | Human Augmentation Index | AI as tool vs AI as replacement β measured by role composition | SEC filings, job boards | UNGROUNDED |
| ID | Name | Signal | Source | Status |
|---|---|---|---|---|
| U.1 | Relational Integrity | Customer complaint volume, complaint response rate | CFPB, FTC Sentinel | PARTIAL |
| U.2 | Simulated Empathy Detection | Chatbot vs human support mix, escalation paths | Glassdoor, complaints | UNGROUNDED |
| U.3 | Culture Authenticity | Glassdoor trajectory, CEO approval vs recommend gap | Glassdoor | PARTIAL |
| U.4 | Inclusion Signal | HRC CEI score, DEI programs, accessibility | HRC, DEI disclosures | PARTIAL |
| U.5 | Community Investment Depth | Philanthropy vs PR measure | β | DEFERRED |
| ID | Name | Signal | Source | Status |
|---|---|---|---|---|
| M.1 | Principled Action | Regulatory violations, SEC enforcement | SEC, OSHA, EPA ECHO | PARTIAL |
| M.2 | Accountability Structure | Board independence, executive pay ratio | SEC DEF 14A | UNGROUNDED |
| M.3 | Workplace Safety | OSHA incident rate, fatality rate vs industry | OSHA via DOL | PARTIAL |
| M.4 | Optimization Harm | Algorithmic harm, dark pattern detection, AHIβ’ flags | Public reporting, FTC | UNGROUNDED |
| M.5 | CEO Accountability | Compensation-to-median-worker ratio, public commitments | SEC DEF 14A | UNGROUNDED |
| ID | Name | Signal | Source | Status |
|---|---|---|---|---|
| A.1 | Climate Integrity | CDP score, Scope 1+2+3 disclosures, trajectory | CDP, SEC Climate | PARTIAL |
| A.2 | Pollution & Compliance | EPA ECHO violations, remediation history | EPA ECHO | PARTIAL |
| A.3 | Land & Water | Industry baseline, USDA Organic, water stress | Industry, EPA, USDA | UNGROUNDED |
| A.4 | AI Infrastructure Cost | Data center power, water for compute, AI emissions | SEC, public disclosures | UNGROUNDED |
| ID | Name | Signal | Source | Status |
|---|---|---|---|---|
| N.1 | Disclosure Quality | Readability vs legalese in material filings | β | DEFERRED |
| N.2 | Humanwashing Detection | Revenue-per-employee anomalies, AI-replaces mismatches | SEC 10-K, jobs data | UNGROUNDED |
| N.3 | Proactive Disclosure | Voluntary reporting beyond legal minimum | β | DEFERRED |
| N.4 | Material Event Timing | Time-to-disclosure for material events | β | DEFERRED |
| N.5 | Filing Volume | 8-K frequency, 10-K completeness, amendment rate | SEC EDGAR | GROUNDED |
HI Grade applies three harm-detection systems, all flowing into dimension scores (no separate gates):
HD does not penalize companies for selling products that consumers knowingly choose: sugary beverages, alcohol, gambling, unflavored tobacco. These products may cause harm, but the harm flows from informed consumer choice. HI Grade is not the consumer's parent.
HD does penalize when consent was not possible:
The composite score is the simple mean of the five HUMAN dimensions, with one floor rule:
If any HUMAN dimension scores below 42, the composite is capped at 50.
This protects against severe single-dimension failure being averaged away by strong scores in other dimensions. A company cannot earn a composite above 50 if even one HUMAN dimension is in critical failure (< 42), regardless of how the other four perform.
When the floor fires:
composite is capped at 50 (or kept at the natural mean if already β€ 50)floor_triggered: true in the API responsetriggering_dimension indicates which dimension caused the cap (H/U/M/A/N)This rule replaces a multi-tier floor system used in earlier specs (any dim < 10 β cap 40, one dim < 42 β cap 49, two+ dims < 42 β cap 41), simplified to one clear, defensible threshold.
Examples:
Note: sub-signal scores below 42 do not trigger the floor. Only dimension-level scores (D_H, D_U, D_M, D_A, D_N) count. Sub-signals are component inputs; the dimension is what matters for floor evaluation.
The Balanced Board highlights companies with no weak HUMAN dimension. Earned algorithmically, never purchased. A company makes the Balanced Board when it meets all three criteria:
A single weighted composite β the approach most ESG frameworks use β averages away disqualifying signals. A company can be excellent on four dimensions and ethically compromised on the fifth, and still rate highly. HI Grade refuses to do this. Each dimension is an independent check.
When Oracle executed mass layoffs in April 2026 affecting an estimated 25,000+ workers, their backward-looking SEC filings still showed the pre-layoff workforce. A scoring system reading only last-quarter's 10-K would have given Oracle a high H score the day thousands of workers were locked out of internal systems at 6 a.m. EST. HI Grade's momentum gate catches this in real time by monitoring news, 8-K filings, WARN Act notices, and other decay signals across a 90-day rolling window.
Every score passes through three validation layers. They exist to prevent any single source β including bad data, gaming attempts, or short-term news cycles β from distorting the math.
Before any data enters scoring, it's checked for sanity. Negative headcounts, impossible Glassdoor ratings (above 5.0 or below 1.0), revenue-per-employee anomalies, and other physically impossible inputs are rejected at ingestion. The pipeline logs the rejection and continues with verified inputs.
After scoring, results are checked for stability and shape. Sub-signal scores must fall within their defined ranges. Composite scores are checked against historical stability β large unexplained moves trigger manual review. Distribution shape is monitored β if every company suddenly scored 70+, something broke. Known leaders (companies on the Balanced Board for 6+ months) act as sentinels: sudden drops trigger investigation.
No single source can move a sub-signal score by more than 15 points. If one CFPB report would change U.1 by 30 points, the impact is capped at 15. The remaining movement requires a corroborating source.
This prevents a single bad data point β or a coordinated gaming attempt β from poisoning a score. The cap exists because we trust no source completely, including ourselves.
Every HI Grade is reconstructable from public data. The API returns every sub-signal value, every source, every penalty applied. No black boxes.
The response contains the full breakdown:
division / addiction / manipulation / transparency / human_overrideThe entire pipeline source is on GitHub under Apache 2.0. Run it yourself. Get the same answer. That's the point.
HI Grade is estimated from public data. Sometimes public data is wrong, or missing, or misattributed. When that happens, the score is wrong. We fix it.
[SCORE-CHALLENGE] TICKER and include: sub-signal, current value, proposed value, citation URL.What we don't do: remove unflattering scores on request. A score based on verified public data stays. If you disagree with the methodology, the source is open β propose a better rubric.
Found a security vulnerability? Don't open a public GitHub issue. Email hi@thehibalance.org with subject [SECURITY] and a description of the issue.
Full scope, acknowledgement timelines, and what we care about: SECURITY.md on GitHub.
A transparency framework that hides its own limitations is hypocritical. Every limitation below is tracked and has a path forward. The math decides β and when the math is incomplete, we say so.
This is the dominant pattern in v1.2.0. Most scoring uses authoritative data (SEC, CFPB, OSHA, EPA, CDP) with editorial thresholds. The data is ground truth; the score bands (e.g., <100 CFPB complaints per $B = 85 points) were chosen by engine authors.
Path forward: grounding thresholds against B Corp quintiles, CFPB published distributions, BLS industry medians.
Five sub-signals (H.4 Decision Authority, U.5 Moral Courage, N.1 Reporting Quality, N.3 Subsidiary Transparency, N.4 Supply Chain) are spec'd but deferred to v1.3 because the data ladders aren't ready.
Path forward: v1.3 target. Each requires either a new data source or a new scoring ladder.
CFPB complaints power U.1 (Customer Empathy) and M.1 (Pricing Ethics). Banks, credit cards, mortgages, debt collectors are covered well. A coffee company isn't. For ~80% of companies we score, CFPB returns zero β not because they're saints, but because CFPB doesn't regulate that sector.
Path forward: BBB complaints (now ingested), FTC actions (already scoring), CPSC SaferProducts (already scoring). Per-sector regulators being audited.
iFixit publishes detailed repairability scores for consumer electronics β Apple, Samsung, Dell, HP, Microsoft Surface, ~10 others. For the other 700+ companies, A.4 (Product Lifecycle) falls back to industry defaults.
Path forward: EU Extended Producer Responsibility datasets publish repairability scores. Ingestion planned for v1.2.
Harm Documentation covers 14 categories with strong coverage from 2020 onward. For historical events β tobacco, asbestos, J&J talc β we maintain a curated Major Harm Events dictionary encoding court settlements. Confident in the dictionary, but incomplete.
Path forward: systematic backfill from EPA Superfund site attributions, state AG settlement databases, academic attribution studies.
Many sub-signals normalize by industry. Medians were derived from aggregated SEC data but are currently hardcoded constants. Last refresh Q1 2026; next scheduled Q3 2026.
When per-company SEC workforce data is unavailable, sub-signals fall back to these industry medians. This affects roughly 30-40% of the S&P 500 universe today and means companies in the same industry can show identical H sub-signals β e.g., Coca-Cola and Starbucks both default to consumer-goods medians until broader workforce data sources are integrated. We disclose rather than mask the fallback; expanding per-company H signal coverage is on the v1.2.1 roadmap.
Path forward: dynamic industry median computation from the live company universe. Technical work planned for v1.3.
v1.0.2 renormalization note: H.4 (CEO Accountability) was removed from the H-dimension calculation. CEO accountability is now reflected in the Heartbeat decay system and in M.4 (Product Ethics). Remaining H sub-signals (H.1, H.2, H.3, H.5) were proportionally rescaled to sum to 1.0.
D_N weights note: D_N (Natural Transparency) currently uses equal 0.5/0.5 weights between N.2 (Reporting Quality) and N.5 (Filing Volume). Sub-signals N.1, N.3, N.4 are deferred to v1.3. Additional grounded signals (DSA transparency, 12b-25 late-filings) are on the v1.3 roadmap.
The Decay Index aggregates layoff mentions, 8-K filings, WARN notices, CEO accountability into 0-100. Inputs are authoritative; thresholds mapping to Stable / Watch / Warning / Critical are editorial.
Path forward: validating against known decay events (Oracle 2026, Twitter 2022, Meta metaverse pivot). Ongoing.
HI Grade uses no AI in the scoring pipeline. No large language models summarizing reports. No neural networks classifying sentiment. No machine learning inferring "good" or "bad." Every score is a deterministic computation on public data.
We do use AI to help build the system β write Python, draft documentation, test edge cases. We don't use AI to judge any company. The distinction matters: LLMs are excellent for engineering work and explanation, and untrustworthy for the kind of consistent, auditable, high-stakes judgments ESG ratings require.
The entire value proposition of HI Grade is reproducibility and trust. An LLM-generated score can't be reproduced exactly, can't be audited line-by-line, and can't defend itself against "why did you rate us that way?" in a regulatory or legal context. Math and data can.
42 public sources. Zero AI. Zero black boxes. Run it yourself.
Refreshed nightly. Authoritative regulatory data.
Live API. Industry benchmarks and corporate intelligence.
Quarterly refresh. Specialized public records.
Quarterly manual review. Third-party verified standards.
Derived from above sources or curated public records.