15.071 | Spring 2017 | Graduate

The Analytics Edge

5.4 Predictive Coding: Bringing Text Analytics to the Courtroom (Recitation)

5.4 Predictive Coding: Bringing Text Analytics to the Courtroom (Recitation)

Video 2: The Data

In this recitation, we’ll be using the dataset energy_bids (CSV - 2.0MB). Please download and save this dataset to your computer so that you can follow along. This data comes from the 2010 TREC Legal Track.

An R script file with all of the commands we will be using in this recitation can be downloaded here: Resource Unit5_Recitation (R).

Course Info

As Taught In
Spring 2017
Level
Learning Resource Types
Lecture Videos
Lecture Notes
Problem Sets with Solutions