home

epstein-data
Research ▼
🔍 SearchFull-text document search 🤖 Ask AIAI research assistant 🔎 Evidence MapFBI serial resolution 📷 Reverse Image SearchCLIP + face across 614K images 🧑 Find Face BETASearch 29K faces by photo 💻 Run Your OwnDownload & search locally
Explore ▼
📚 Full Text Corpus1.39M docs, 2.77M pages 🌎 Global Heatmap145 countries mentioned 📈 Coverage MapWhat's here 🌌 AtlasSemantic map · 1.29M docs ⚖ Cases53 federal & state cases · per-case briefings 🎤 DepositionsTranscribed audio & video 💬 Hear from the SurvivorsSurvivors in their own words 📖 Cover to Cover-Up24-hour public reading, synced to the video ✉ Wolff–Epstein Emails2,009 messages · 2009–2019
📷 Images92K analyzed photographs 🔍 Multi-DB SearchSearch all databases individually 🗃 All Databases14 searchable databases
Entities Reports
News ▼
📰 NewsCoverage & reporting ⚖ Justice MonitorArrests, charges, lawsuits, firings
Source ▼
🏛 DOJ ProductionOfficial EFTA disclosures 📜 EFTA Law TextPublic Law 119-38 📁 Source Data (GitHub)Open source databases
🌐 Community ResourcesCurated external projects ✉ ContactGeneral · privacy · DMCA · press
❤️ Donate 🎧 Podcast

Research

🔍 Search Documents 🤖 Ask AI 🔎 Evidence Map 📷 Reverse Image Search 🧑 Find Face BETA 💻 Run Your Own Investigator

Explore

📚 Full Text Corpus 🌎 Global Heatmap 📈 Coverage Map 🌌 Atlas ⚖ Cases 🎤 Depositions 💬 Hear from the Survivors 📖 Cover to Cover-Up ✉ Wolff–Epstein Emails 📷 Images 🔍 Multi-DB Search 🗃 All Databases

Entities

👥 Entity Directory

Reports

Browse All Reports 📰 News ⚖ Justice Monitor

Source

🏛 DOJ Production 📜 EFTA Law 📁 Source Data (GitHub) 🌐 Community Resources ✉ Contact
🎧 Podcast & Newsletter ❤️ Donate Privacy Policy

HOUSE_OVERSIGHT_016997

← Prev Next →
Loading document…

(“3.14159”) and typos (“excesss”). An n-gram is sequence of enormous growth: the addition of ~8500 words/year has 1-grams, such as the phrases “stock market” (a 2-gram) and increased the size of the language by over 70% during the last “the United States of America” (a 5-gram). We restricted n to fifty years (Fig. 2A). 5, and limited our study to n-grams occurring at least 40 times Notably, we found more words than appear in any in the corpus. dictionary. For instance, the 2002 Webster’s Third New Usage frequency is computed by dividing the number of International Dictionary [W3], which keeps track of the instances of the n-gram in a given year by the total number of contemporary American lexicon, lists approximately 348,000 words in the corpus in that year. For instance, in 1861, the 1- single-word wordforms (/0); the American Heritage gram “slavery” appeared in the corpus 21,460 times, on Dictionary of the English Language, Fourth Edition (AHD4) 11,687 pages of 1,208 books. The corpus contains lists 116,161 (//). (Both contain additional multi-word 386,434,758 words from 1861; thus the frequency is 5.5x10°. entries.) Part of this gap is because dictionaries often exclude “slavery” peaked during the civil war (early 1860s) and then proper nouns and compound words (“whalewatching”). Even again during the civil rights movement (1955-1968) (Fig. 1B) accounting for these factors, we found many undocumented In contrast, we compare the frequency of “the Great War” words, such as “aridification” (the process by which a to the frequencies of “World War I” and “World War II.” “the geographic region becomes dry), “slenthem” (a musical Great War” peaks between 1915 and 1941. But although its instrument), and, appropriately, the word “deletable.” frequency drops thereafter, interest in the underlying events This gap between dictionaries and the lexicon results from = had not disappeared; instead, they are referred to as “World a balance that every dictionary must strike: it mu

Suggest a category
Misclassified? Pick a better fit.
Community Notes
▸ People Mentioned
▸ Interest Level
Routine Notable Significant
▸ Dates Mentioned
▸ Related Topics
▸ Places & Organizations
▸ Transcription Correction
▸ Research Notes 0
No notes yet.
Related documents
Source Data Investigation Reports DOJ EFTA CC BY-NC-SA 4.0 Contact
Independent research project. Not affiliated with the U.S. Department of Justice, FBI, any government agency, or Anthropic. All analytical text on this site is AI-generated (Claude, Anthropic) and iteratively fact-checked against source documents, but may contain errors. Verify all claims against linked EFTA sources before citing.
Powered by Datasette  ·  ❤️ Buy me a coffee

You are leaving epstein-data.com

You are being redirected to an external website not operated by this project. We are not responsible for the content or privacy practices of external sites.

Powered by Datasette