Data mining =
Hindsight
Hindsight is learning from past mistakes in order to build expertise. It involves critical reflection to integrate knowledge gained from historical outcomes with other knowledge sources — creating a foundation of evidence that no individual human reader can match.
In ScriptBook's context, hindsight means qualitative data mining: computational methods to gather, process, and analyze large quantities of textual data (film scripts), extracting and exploiting the patterns, structures, and signals that correlate with box office performance.
Our database contains over 100.000 scripts with known theatrical outcomes — spanning films released between 1970 and 2024. This is the largest screenplay training dataset in the industry, and the foundation of everything ScriptBook predicts.