What is a ForIntel Research Field Atlas?

A Research Field Atlas is a structural map of how a research field is built. It reads four field vectors directly against the scholarly-publication record — concept-cluster structure (how the field divides into research clusters), citation-magnitude ranking (where the field's citation mass concentrates), institutional concentration (which institutions produce the field's most-cited output), and time evolution (when the field formed, how it has matured, and whether it is still active) — and turns that into where a funder, department head or analyst should place reading, scouting and partnership attention. It maps field structure, not scientific merit: it does not adjudicate whether any claim is correct, rank therapies by efficacy, or make a recommendation. Every count traces to a retrieved record, capped pulls are reported as floors rather than rounded up, and the highest-stakes claim is independently re-verified against a second corpus.

What did this sample find about the COVID-19 vaccines-and-variants field?

The field is a concentrated structure. On concept structure, a corpus of 595 distinct works concentrates in a single core research cluster of 371 works, with a long specialist tail — clinical research (48), epidemiology (26), mental-health (22), long-term effects (17), and vaccine coverage/hesitancy and computational drug discovery. On citation magnitude, the field's gravity wells are a tight early-2020 founding cohort: the clinical-characterization papers (~31,000 and ~30,000 citations), the novel-coronavirus and ACE2/TMPRSS2 cell-entry virology (~23,000 and ~21,000), and the pivotal BNT162b2 and mRNA-1273 vaccine-efficacy reports (~15,500 and ~10,700). On institutional structure, a compact UK-and-China-led core leads — University of Oxford (49 works), University College London (31), University of Hong Kong (29), Cambridge, Harvard and Imperial (22 each), and the LSHTM (21) — with the Chinese Academy of Sciences carrying an outsized citation footprint relative to its work count. On time evolution, a 2020 founding burst of 213 works has consolidated year over year (158 in 2021, 94 in 2022) while the field keeps publishing. The read: a mature, consolidating field deepening around its founding cohort, with the leverage at the specialist edges.

How do I commission a Research Field Atlas?

Contact the ForIntel desk at forintel@foragentis.com or scope an engagement on the ForIntel order page at foragentis.com/forintel#order. The Atlas tier delivers the four-vector structural map shown in this sample. A deeper engagement adds the directed citation-graph layer to identify the bridge researchers who connect the vaccine and variant sub-areas and the citation-co-occurrence linkage between clusters; the patent IP-translation read once the patent-record credential is provisioned; and an uncapped, funder-metadata-bearing corpus pull to quantify the full field and add the funding-flow layer.

Sample ForIntel Research Field Atlas — How the COVID-19 Vaccines-and-Variants Field Is Built

Q: What data and method does the Atlas use, and does it judge scientific merit?

The Atlas is built on the scholarly-publication record — a concept-anchored corpus clustered on each work's concept tag, ranked by citation magnitude, aggregated by producing institution, and counted by publication year — cross-checked against a biomedical-literature corpus, with the highest-stakes claim (the field's continued publication velocity) independently re-verified against that second corpus. It does not judge scientific merit. It reports field structure only: it does not adjudicate whether any vaccine or variant claim is correct, rank therapies by efficacy, or make a clinical recommendation. Citation magnitude is treated as a structural signal — where attention concentrates — not as a statement that any work is scientifically correct. The corpus pulls were capped, so every count (the 595-work corpus, the cluster sizes, the institutional work counts, the per-year volumes) is reported as a floor, and the load-bearing reads are the shapes of the concentration rather than exact censuses.

What this sample shows

This is a public sample of a ForIntel Research Field Atlas deliverable, published by Foragentis to demonstrate the method. It is an academic field map of the COVID-19 vaccines-and-variants research field — there is no private buyer to redact. Every institution and concept named here is public scholarly information, named in full; the structural findings describe how a public research field is organized, not anyone's internals.

It demonstrates what the Atlas reads: a structural map of a research field across four directly observed field vectors — concept-cluster structure, citation-magnitude ranking, institutional concentration, and time evolution — each carrying an explicit confidence label, with the highest-stakes claim independently re-verified against a second corpus, and with the deliberate boundaries named rather than glossed. Above all, this is a map of field structure, not scientific merit: it reports how the field is organized and where to place attention, and makes no scientific, clinical or efficacy claim about COVID vaccines or variants — any scientific interpretation is the reader's, with domain experts.


Field	COVID-19 vaccines & variants research · concept-anchored corpus
Scope	Field-structure atlas · concept / magnitude / institution / evolution
Window	2020 onward (founding-era structure) · trailing-24-month velocity · Jun 2026
Method	Scholarly-publication record (concept clusters, citation magnitude, institutional structure, time evolution) + biomedical-literature cross-corpus, with the highest-stakes claim independently re-verified
Prepared by	ForIntel by Foragentis

The verdict

The COVID-19 vaccines-and-variants field is a concentrated structure: one dominant research cluster, a citation mass anchored on an early-2020 founding cohort, a compact UK-and-China-led institutional core — mature, consolidating, and still publishing.

This is a structural map of the field across four vectors. On concept structure, the field corpus — 595 distinct works — concentrates in a single core research cluster of 371 works, with a long tail of specialist clusters (clinical research, epidemiology, mental-health, long-term effects, vaccine coverage/hesitancy, computational drug discovery). On citation magnitude, the field's gravity wells are a tight early-2020 founding cohort — the clinical-characterization papers, the novel-coronavirus virology work, and the pivotal vaccine-efficacy reports — carrying tens of thousands of citations each. On institutional structure, a compact set led by Oxford, UCL, Hong Kong, Imperial, Harvard, Cambridge and the LSHTM produces a disproportionate share of the field's most-cited output. On time evolution, the field is a 2020 founding burst of 213 works that has consolidated year over year while continuing to publish. The actionable structure is not whether the field is active — it plainly is — but where the centre of gravity and the specialist gaps sit, so a funder, department head or analyst can place reading, scouting and partnership attention deliberately.

The field concentrates in one dominant research cluster, with a long specialist tail. The scholarly-publication corpus — 595 distinct field works — organizes into a single core research cluster of 371 works, with a long tail of adjacent specialist clusters: clinical-research studies (48), epidemiology (26), mental-health research (22), long-term-effects research (17), vaccine coverage / hesitancy, and computational drug discovery. The structural read: the field has a large, tightly-held centre of gravity with specialist work organized around it, not a set of evenly-sized sub-fields. The corpus pulls were capped, so every count is a floor — the read is the cluster structure, not an exact census.
The citation mass concentrates on an early-2020 founding cohort. Ranked by citation magnitude (how heavily each work is cited), the field's foundational layer is a tight early-2020 cohort: the clinical-characterization papers (~31,000 and ~30,000 citations), the novel-coronavirus identification and cell-entry-mechanism virology work (~23,000 and ~21,000), and the pivotal vaccine-efficacy reports (the BNT162b2 and mRNA-1273 trials, ~15,500 and ~10,700). These are the field's gravity wells — where attention concentrates. Citation magnitude is a structural signal, not a statement that any work is scientifically correct.
A small set of institutions produces a disproportionate share of the field. Aggregating the field corpus by producing institution (counts are floors), a compact leadership set dominates: University of Oxford (49 works), University College London (31), University of Hong Kong (29), University of Cambridge (22), Harvard University (22), Imperial College London (22) and the London School of Hygiene & Tropical Medicine (21), with the Chinese Academy of Sciences carrying an outsized citation footprint relative to its work count. The structural read: a UK-and-China-anchored institutional core produces the field's most-cited output — the natural places to scout talent or seed partnership.
The field is a 2020 founding burst that consolidates — and is still publishing. By publication date the field's mass concentrates in a 2020 founding burst of 213 works that consolidates year over year (158 in 2021, 94 in 2022) — the canonical pandemic-research arc of an explosive founding cohort the field keeps building on. A separate current-activity confirmation shows the field is still actively publishing (200 works dated 2026 in the recency-sorted pull), so the consolidation is a maturation of the foundational cohort, not a cooling of the field. The recent biomedical-literature velocity corroborates continued output in both the vaccine and variant streams.

In one line: The COVID-19 vaccines-and-variants field is a concentrated structure — one dominant research cluster of 371 works inside a 595-work corpus, a citation mass anchored on an early-2020 founding cohort, a compact UK-and-China-led institutional core, and a 2020 founding burst that has consolidated while the field keeps publishing. The actionable structure is not 'is the field active' — it is — but where the centre of gravity sits and which specialist clusters and institutions are the highest-leverage places to place reading, scouting or partnership attention.

How to read this report. This is a field-structure map — it reports how the COVID-19 vaccines-and-variants research field is organized (concept clusters, citation magnitude, institutional concentration, time evolution) and turns that into where to place field attention. It is not a scientific or clinical assessment: it does not adjudicate whether any vaccine or variant claim is correct, rank therapies by efficacy, or make a medical recommendation — any scientific interpretation is the reader's, with domain experts. A High confidence label marks a directly observed structural signal. Every quantitative claim traces to a specific retrieved record; capped pulls are reported as floors, never rounded up. One structural layer — the directed citation graph (the who-cites-whom edges that would let us name bridge researchers and the citation-co-occurrence linkage between clusters) — is deferred as an operationally-heavy enhancement and named honestly in the closing section; citation magnitude is fully present, only the directed edges are deferred. The highest-stakes claim — the field's continued publication velocity — was independently re-verified against a second corpus.

01 · Concept-cluster structure — one centre, or many?

(Confidence: High.) The first structural question is how the field divides into research clusters — because where the work concentrates tells a reader where the field's centre of gravity is and where the specialist edges sit. Anchored on the field's core concepts and deduplicated to distinct works, the corpus is 595 distinct field works, and it does not divide evenly. One core research cluster of 371 works — the central SARS-CoV-2-and-COVID-19 research body — dominates, with a long tail of adjacent specialist clusters organized around it. (Source: the scholarly-publication record, concept-anchored corpus, clustered on each work's concept tag.)

Concept cluster	Works (floor)	Position in the field
Core SARS-CoV-2 / COVID-19 research	371	Dominant centre of gravity
Clinical-research studies	48	Largest specialist cluster
Epidemiology	26	Specialist tail
Mental-health research	22	Specialist tail
Long-term-effects research	17	Specialist tail
Vaccine coverage / hesitancy · computational drug discovery	—	Specialist tail (capped)
Field corpus total	595	Distinct works (floor)

Figure — One dominant research cluster, with a long specialist tail (field corpus · works by concept cluster · 595 distinct works). The scholarly-publication corpus concentrates in a single core research cluster of 371 works — several times the size of any specialist cluster — with a long tail of specialist clusters around it. The pulls were capped, so every count is a floor; the load-bearing read is the cluster structure, not an exact census.

Two structural reads matter. First, the field has a large, tightly-held centre — the core cluster is several times the size of any specialist cluster — so the typical work sits in the mainstream of the field rather than in a niche. Second, the specialist tail is where the field's distinct sub-conversations live: clinical-research methodology, epidemiological modelling, the mental-health and long-term-effects literatures, and the vaccine-coverage / hesitancy and computational-drug-discovery clusters. For a reader deciding where to read or seed work, the centre is well-covered and crowded; the specialist clusters are the more legible places to find an under-attended edge. The whole map is built from capped pulls, so the cluster sizes are floors — the load-bearing read is the shape of the concentration (one dominant cluster, a long tail), which a deeper, uncapped corpus would sharpen but not overturn.

02 · Citation-magnitude ranking — where the field's citation mass concentrates

(Confidence: High.) The second structural question is which works the field is built on — its foundational layer. We read this through citation magnitude: how heavily each work is cited, supplied inline with every record. The field's most-cited works are a tight, coherent early-2020 founding cohort. (Source: the scholarly-publication record, ranked by citation magnitude.)

The clinical-characterization papers anchor the field. The two most-cited works — the early clinical-characteristics-of-COVID-19 report (~31,000 citations) and the novel-coronavirus-from-pneumonia-patients identification (~30,000) — are the clinical and identification foundations the rest of the field builds on.
The virology cohort is the mechanistic spine. The pneumonia-outbreak / coronavirus-origin work (~23,000) and the SARS-CoV-2 cell-entry-via-ACE2/TMPRSS2 mechanism paper (~21,000) carry the field's mechanistic foundation, alongside the early transmission-dynamics and outbreak-lessons reports.
The pivotal vaccine-efficacy reports are the vaccine sub-area's gravity wells. The BNT162b2 (~15,500) and mRNA-1273 (~10,700) trial reports are the most-cited vaccine works — the foundational layer of the vaccine sub-conversation specifically.

Foundational work	Citation magnitude (approx.)	Layer
Clinical-characteristics-of-COVID-19 report	~31,000	Clinical characterization
Novel-coronavirus-from-pneumonia-patients identification	~30,000	Clinical / identification
Pneumonia-outbreak / coronavirus-origin work	~23,000	Virology
SARS-CoV-2 cell-entry via ACE2/TMPRSS2 mechanism	~21,000	Virology
BNT162b2 vaccine-efficacy trial report	~15,500	Vaccine efficacy
mRNA-1273 vaccine-efficacy trial report	~10,700	Vaccine efficacy

Figure — The field's gravity wells — where the citation mass concentrates (field corpus · most-cited works by citation magnitude). The foundational layer is the early-2020 clinical-characterization and virology cohort, joined by the pivotal vaccine-efficacy reports, each carrying citation magnitude an order of magnitude above the typical work. Citation magnitude is a field-structure signal — where attention concentrates — not a statement about the scientific merit or correctness of any individual work.

The structural read is the concentration: the field's citation mass is held by a small early-2020 cohort, an order of magnitude above the typical work, and that cohort spans both sub-areas — the clinical / virology foundation and the pivotal vaccine-efficacy reports. For a reader, these are the works a newcomer to the field must read first and the lineages most downstream work descends from. The essential discipline: citation magnitude measures attention, not correctness — a heavily-cited work is a structural landmark, not an endorsed scientific conclusion, and this Atlas makes no claim about whether any of these works is right. One layer is deliberately deferred here: the directed who-cites-whom edges that would let us trace exactly which later works cite which foundational work — the citation lineage — are an operationally-heavy enhancement named in the closing section; the magnitude read above is complete.

03 · Institutional concentration — which institutions produce the field

(Confidence: High.) The third structural question is who produces the field — because the institutional concentration tells a funder or a department head where the talent and the most-cited output sit. Aggregating the field corpus by producing institution, a compact leadership set carries a disproportionate share of the field's output. (Source: the scholarly-publication record, field corpus aggregated by producing institution, with each institution's cumulative citation footprint.)

A UK-anchored academic core leads on output. University of Oxford (49 works) leads, followed by University College London (31), University of Cambridge (22), Imperial College London (22), the London School of Hygiene & Tropical Medicine (21) and King's College London (14) — a strikingly UK-weighted production core. (Work counts are floors on the field corpus.)
Hong Kong and US institutions complete the leadership set. The University of Hong Kong (29), Harvard University (22), the University of Washington (20), Johns Hopkins (14), the National Institutes of Health and the University of North Carolina at Chapel Hill round out the most-productive institutions.
Citation footprint tells a slightly different story. The Chinese Academy of Sciences carries an outsized citation footprint (driven by the foundational early-2020 virology cohort) relative to its work count — a structural signal that its smaller output sits disproportionately in the field's most-cited founding layer.

Producing institution	Works (floor)	Note
University of Oxford	49	Leads field output
University College London	31	UK production core
University of Hong Kong	29	Hong Kong leadership
University of Cambridge	22	UK production core
Harvard University	22	US leadership
Imperial College London	22	UK production core
London School of Hygiene & Tropical Medicine	21	UK production core
University of Washington	20	US leadership
King's College London	14	UK production core
Johns Hopkins University	14	US leadership
Chinese Academy of Sciences	—	Outsized citation footprint relative to work count

Figure — A compact, UK-and-China-anchored institutional core (field corpus · works by producing institution, with cumulative citation footprint). Oxford leads at 49 works ahead of a UK-weighted core (UCL, Cambridge, Imperial, the LSHTM, King's) and a Hong Kong and US set (Hong Kong, Harvard, Washington, Johns Hopkins); the Chinese Academy of Sciences carries an outsized citation footprint relative to its volume. Work counts are floors (capped pulls).

The structural read is a concentrated, UK-and-China-anchored institutional core producing the field's most-cited work. For a reader, this is the map of where to scout talent, where the existing collaboration density is highest, and which institutions to approach for partnership. One honest boundary travels with this layer: the field corpus carried no usable funder / grant metadata — the funding-flow read that would name which funders underwrite the field is not recoverable from the returned records and is named as a boundary in the closing section, so the institutional read here is an output-and-citation concentration read, not a funding-flow read. And the work counts are floors (capped pulls), so the read is the relative concentration among institutions, not an exact ranking by one or two works.

04 · Time evolution — how the field grew, and whether it is still active

(Confidence: High.) The fourth structural question is the field's trajectory over time — when it formed, how it has matured, and whether it is still active. By publication date the field has the canonical pandemic-research shape: an explosive 2020 founding burst that consolidates year over year. (Source: the scholarly-publication record, field corpus by publication year, plus an independent current-activity confirmation pull.)

Publication year	Works (floor)	Basis
2020	213	Founding burst
2021	158	Consolidation
2022	94	Consolidation
2026	200	Recency-sorted current-activity confirmation (different basis)

Figure — A 2020 founding burst that consolidates — and a field still publishing (field corpus · works by publication year · with a current-activity confirmation). The publication mass concentrates in a 2020 founding burst of 213 works that consolidates year over year (158 in 2021, 94 in 2022). The 2026 bar is on a different basis — a recency-sorted current-activity confirmation pull (200 works dated 2026) shown to confirm the field is still actively publishing, not to compare against the founding-era counts. Every count is a floor (capped pulls).

Two structural reads matter. First, the field is mature, not nascent: its foundational mass was laid down in 2020 and the field has been consolidating and building on it ever since — the year-over-year decline in new-founding-cohort volume is the normal maturation of a field whose canonical works are already written, not a sign the field has gone quiet. Second, the field is still actively publishing: the current-activity confirmation returns a full pull of works dated to the current year, and the recent biomedical-literature velocity (an independent second corpus) corroborates continued output in both the vaccine and the variant streams — the variant stream in particular shows rising recent biomedical activity. The honest framing: this is a maturing, consolidating field that remains live, and the reader should read the consolidation as the field deepening around its founding cohort, not cooling. The highest-stakes claim here — that the field is still active — was independently re-verified against a second corpus, and it holds.

05 · What this means for you — where to place field attention

Anchor any read of the field on the early-2020 founding cohort — then look to the specialist clusters for the under-attended edge. The citation mass is concentrated in a small, coherent founding cohort (the clinical-characterization, virology and pivotal vaccine-efficacy works), so a newcomer's reading list and a funder's baseline both start there. But the crowded centre is not where marginal attention pays off — the specialist clusters (long-term effects, vaccine coverage / hesitancy, computational drug discovery, the mental-health literature) are the more legible places to find an under-served edge. Rationale: the concept-cluster and citation-magnitude vectors show one dominant, well-covered centre and a specialist tail; the leverage is at the edges, not the centre.
Scout talent and seed partnership at the compact UK-and-China-led institutional core. A small set of institutions — Oxford, UCL, Hong Kong, Imperial, Harvard, Cambridge, the LSHTM — produces the field's most-cited output, and the Chinese Academy of Sciences carries an outsized citation footprint relative to its volume. A funder or department head placing recruitment or collaboration attention should start with that core, and treat the high-citation-footprint-per-work institutions as the ones sitting closest to the field's founding layer. Rationale: the institutional-concentration vector names the compact production core and the citation-footprint outliers directly.
Read the field as maturing and live, not cooling — and target the still-rising sub-streams. The 2020 founding burst has consolidated, but the field is still publishing, and the recent biomedical velocity shows continued output in both sub-areas — the variant stream in particular shows rising recent activity. A reader deciding where new money or attention earns the most should weight the still-rising sub-streams over the saturated founding-era topics. Rationale: the time-evolution vector (re-verified against a second corpus) shows consolidation plus continued activity, with the variant biomedical stream rising most recently.
Commission the directed citation-graph layer to convert this structural map into a connectivity map. This Atlas maps where the clusters, the citation mass and the institutions sit; the natural next step is the directed who-cites-whom layer — which would identify the bridge researchers who connect the vaccine and variant sub-areas and the citation-co-occurrence linkage between clusters. That layer is named as an operationally-heavy enhancement in the closing section (the magnitude read is complete; only the directed edges are deferred). Commission it to turn the map of where things sit into a map of how they connect. Rationale: the directed citation graph is the highest-leverage disclosed enhancement — it is the single layer that would add the bridge-researcher and cluster-linkage reads.

Scope, confidence & what a deeper engagement adds

This Research Field Atlas maps four field vectors — concept-cluster structure, citation-magnitude ranking, institutional concentration, and time evolution — all directly observed against the returned scholarly-publication record, with the highest-stakes claim (the field's continued publication velocity) independently re-verified against a second corpus. The boundaries below are named with the specific reason and the work that closes each. They are diligence boundaries, not findings, and are never presented as such. Above all: this Atlas reports field structure, not scientific merit — it does not adjudicate whether any vaccine or variant claim is correct, rank therapies by efficacy, or make a medical recommendation.

The directed citation graph is a deferred, operationally-heavy enhancement. This Atlas reads citation magnitude (how heavily each work is cited) fully and inline. The directed who-cites-whom layer — the citation-co-occurrence linkage that shows which bodies of work cite together, the bridge-researcher identification (researchers cited across both the vaccine and variant sub-areas), and the reference-lineage walk — requires fanning a per-work citing-set traversal out across every foundational work, and that fan-out exceeded the analysis window across repeated reproductions. This is a near-term capability enhancement (a lighter or asynchronous traversal), not a data absence: the citation counts the structural read rests on are present, and the magnitude, cluster, institution and evolution reads are complete — the enhancement would add the directed edges (who connects to whom), not recover a missing count. We name no bridge researchers and draw no who-cites-whom edges, because that specific data was not captured. Closing it: a lighter-fan-out or asynchronous citation-graph traversal to add the directed edges and the bridge-researcher read.
The patent IP-translation read is a pending-credential layer. The read of which research has translated into protected IP — patent volume and assignees for COVID vaccine / variant inventions — requires a patent-record credential that was not configured for this run, so that layer returned nothing and is not reported. It is a pending-credential boundary, not a finding of "no patents." Closing it: provisioning the patent-record credential and re-running the IP-translation layer.
The corpus pulls were capped — every count is a floor. The concept, magnitude and institution pulls each returned the maximum number of items allowed, so the field-corpus size (595 distinct works), the cluster sizes, the institutional work counts and the per-year volumes are floors, not a complete census. The structural reads — one dominant cluster, an early-2020 citation mass, a compact institutional core, a founding-burst-then-consolidation arc — are unaffected and are the load-bearing reads; the exact volumes are bounded. Closing it: an uncapped corpus pull to quantify the full field and sharpen the exact counts.
The funder / grant metadata was absent — the funding-flow read is not recoverable here. The returned field corpus carried no usable funder / grant metadata, so the institutional read is an output-and-citation-concentration read, not a funding-flow read — this Atlas does not name which funders underwrite the field. Closing it: a funder-metadata-bearing corpus pull to add the funding-flow layer.
The citation read runs on a single scholarly corpus by construction. No independent second corpus indexes the citation graph at this grain, so the citation-magnitude and (when added) directed-graph reads are single-corpus by construction. The publication-velocity claim was cross-verified against an independent biomedical corpus; the citation-structure claims could not be. Closing it: a second citation-graph index to corroborate the citation-structure reads.

This is a field-structure read at the Research Field Atlas tier, built on the scholarly-publication record (concept clusters, citation magnitude, institutional structure, time evolution) and a biomedical-literature cross-corpus. The natural next step is a deeper engagement that turns this map into a connectivity map: (1) the directed citation-graph layer to identify the bridge researchers and the cluster linkage; (2) the patent IP-translation read once the patent-record credential is provisioned; and (3) an uncapped, funder-metadata-bearing corpus to quantify the field and add the funding-flow layer. To commission it, reach the ForIntel desk directly at forintel@foragentis.com.

This is a public sample of a ForIntel Research Field Atlas deliverable, published by Foragentis to demonstrate the method. It is an academic field map with no private buyer — every institution and concept named is public scholarly information. It reports field structure only — how the COVID-19 vaccines-and-variants research field is organized — and makes no scientific, clinical or efficacy claim about COVID vaccines or variants; any scientific interpretation is the reader's, with domain experts. Citation magnitude is a structural signal (where attention concentrates), not a statement about the correctness of any work. The directed citation graph (who-cites-whom edges, bridge researchers, reference lineage) is a deferred operationally-heavy enhancement, not a data absence — citation magnitude is fully present; only the directed edges are deferred, and no bridge researchers or who-cites-whom edges are named. The patent IP-translation read is a pending-credential layer and is not reported. The corpus pulls were capped, so all counts are floors. The citation read runs on a single scholarly corpus by construction; the publication-velocity claim was independently re-verified against a second corpus.