home

epstein-data
Research ▼
🔍 SearchFull-text document search 🤖 Ask AIAI research assistant 🔎 Evidence MapFBI serial resolution 📷 Reverse Image SearchCLIP + face across 614K images 🧑 Find Face BETASearch 29K faces by photo 💻 Run Your OwnDownload & search locally
Explore ▼
📚 Full Text Corpus1.39M docs, 2.77M pages 🌎 Global Heatmap145 countries mentioned 📈 Coverage MapWhat's here 🌌 AtlasSemantic map · 1.29M docs ⚖ Cases53 federal & state cases · per-case briefings 🎤 DepositionsTranscribed audio & video 💬 Hear from the SurvivorsSurvivors in their own words 📖 Cover to Cover-Up24-hour public reading, synced to the video ✉ Wolff–Epstein Emails2,009 messages · 2009–2019
📷 Images92K analyzed photographs 🔍 Multi-DB SearchSearch all databases individually 🗃 All Databases14 searchable databases
Entities Reports
News ▼
📰 NewsCoverage & reporting ⚖ Justice MonitorArrests, charges, lawsuits, firings
Source ▼
🏛 DOJ ProductionOfficial EFTA disclosures 📜 EFTA Law TextPublic Law 119-38 📁 Source Data (GitHub)Open source databases
🌐 Community ResourcesCurated external projects ✉ ContactGeneral · privacy · DMCA · press
❤️ Donate 🎧 Podcast

Research

🔍 Search Documents 🤖 Ask AI 🔎 Evidence Map 📷 Reverse Image Search 🧑 Find Face BETA 💻 Run Your Own Investigator

Explore

📚 Full Text Corpus 🌎 Global Heatmap 📈 Coverage Map 🌌 Atlas ⚖ Cases 🎤 Depositions 💬 Hear from the Survivors 📖 Cover to Cover-Up ✉ Wolff–Epstein Emails 📷 Images 🔍 Multi-DB Search 🗃 All Databases

Entities

👥 Entity Directory

Reports

Browse All Reports 📰 News ⚖ Justice Monitor

Source

🏛 DOJ Production 📜 EFTA Law 📁 Source Data (GitHub) 🌐 Community Resources ✉ Contact
🎧 Podcast & Newsletter ❤️ Donate Privacy Policy

HOUSE_OVERSIGHT_017032

← Prev Next →
Loading document…

a. Conflict resolution involves the decision of whether a query name, associated with multiple records, can unambiguously refer to a single one of them. b. Wikipedia. Conflict resolution for Wikipedia records is carried out on the basis the main article word count and traffic statistics. A conflict is resolved as such : i. Find the cumulative word count of words written in the articles in conflict. ii. Find the cumulative number of views resulting from the traffic to the articles in conflict. iii. For every record in the conflict, find the fraction of words and views resulting from this record by dividing by the cumulative counts. iv. Does a record have the largest fraction of both words written and page views? v. Does this record have above 66% of either words written and page views? vi. If so, the conflicted query name can be considered as being sufficiently specific to the record with these properties. c. Encyclopedia Britannica. Conflict resolution for Encyclopedia Britannica records is carried on the basis of the quantity of information snippets present in the dataset. i. Find the cumulative number of information snippets related to the records in conflicts. ii. For every record in the conflict, find the fraction of informational snippets by dividing with the cumulative count iii. If a record has greater than 66% of the cumulative total, the query name in conflict is considered to refer to this record. l1.7.A.9 Identify the most relevant name used to refer to an individual. So far, we have obtained, for all individuals in both our databases, a set of names by which they can plausibly be mentioned. From this set, we wish to identify the best such candidate and use its word frequency to observe the fame of the person at hand. This optimal name is identified on the basis of the amplitude of the word frequency, the potential ambiguities which arise from name homonimity and the quality of the word frequency time series. Examples are shown in Fig $11 and

Suggest a category
Misclassified? Pick a better fit.
Community Notes
▸ People Mentioned
▸ Interest Level
Routine Notable Significant
▸ Dates Mentioned
▸ Related Topics
▸ Places & Organizations
▸ Transcription Correction
Related documents
Source Data Investigation Reports DOJ EFTA CC BY-NC-SA 4.0 Contact
Independent research project. Not affiliated with the U.S. Department of Justice, FBI, any government agency, or Anthropic. All analytical text on this site is AI-generated (Claude, Anthropic) and iteratively fact-checked against source documents, but may contain errors. Verify all claims against linked EFTA sources before citing.
Powered by Datasette  ·  ❤️ Buy me a coffee

You are leaving epstein-data.com

You are being redirected to an external website not operated by this project. We are not responsible for the content or privacy practices of external sites.

Powered by Datasette