TCS Grep Sharp Club! - A Example Search - The Epstein Files


Shown on this page is a pictoral representation of a TCSGrepSharp search of the Epstein files recently released by the United States Department of Justice pursuant to legislation. The Epstein Files were downloaded from the government website https://www.justice.gov/epstein to a Thompson Custom Software server solid state hard drive. For those not familiar, there are different types of data available broadly divided into document and non-document types. Generally speaking, document type data is amenable to electronic searching and analysis whereas non-document type data like photographs and video is much less amenable to such searching and analysis. Further, document type data can be broadly divided in human readable and binary. In the vast majority of cases, the document type data in the case of the "Epstein files" is NOT stored in human readable form. Rather the document data is stored in Abobe's "Portable Document Format" aka PDF (.pdf) files. Only a very sophisticated application like TCSGrepSharp can decode .pdf files rapidly and accurately and subsequently search this data for specified text or apply carefully crafted regular expressions!

A Windows Explorer window showing the raw .zip files downloaded from the US Department of Justice Website. The zipped size of these nine files is 13 Gigabytes = 13,000,000,000 bytes!
A screenshot of TCSGrepSharp in action. Shown here is the "search in progress" page. This page shows the text or regular expression being searched for - the text "Trump" in this case. The file type or types being searched - *.pdf files here. Recent files that have been searched or are in the progress of being searched. The folder or folders being searched along with whether or not child folders are to be searched. Finally, it shows the user settings for all the configurable parameters of TCSGrepSharp.
This screenshot shows the last page of the search run specification wizard. This page shows all of the user's selections and provides an opportunity to go back and change them before starting the search. The red arrows emphasize the ability to select files to be searched based on their file type - .pdf in this case.
Showing the search specification - the most popular search of this data.
The Epstein files search for "Trump" completed. About 12,000 .pdf files in about 3 and 1/2 minutes. The match count and matches to follow...
745 lines with "Trump" in them were found in 131 files! More to follow...
17 "Trump" matches for in the highlighted file and displayed in the right panel. More to come...
Thompson Custom Software's custom editor to display text from binary files like PDF's also highlights all matches if desired.