Federal Open Market Committee (FOMC) Corpus =========================================== Transcripts of recurring meetings of the Federal Reserve’s Open Market Committee (FOMC), where important aspects of U.S. monetary policy are decided, covering the period 1977-2008. (108,504 conversational exchanges between 364 speakers of FOMC board members in 268 meetings). Distributed together with: `Talk it up or play it down? (Un)expected correlations between (de-)emphasis and recurrence of discussion points in consequential U.S. economic policy meetings `_. Chenhao Tan and Lillian Lee. Presented in Text As Data 2016. Please cite this paper when using FOMC corpus in your research. Dataset details --------------- Speaker-level information ^^^^^^^^^^^^^^^^^^^^^^^^^ Speakers in this dataset are FOMC members, indexed by their name as recorded in the transcripts. * id: name of the speaker * chair: (boolean) is speaker FOMC Chair * vice_chair: (boolean) is speaker FOMC Vice-Chair Utterance-level information ^^^^^^^^^^^^^^^^^^^^^^^^^^^ For each utterance, we provide: * id: index of the utterance (concatenating the meeting date with the utterance’s sequence position) * speaker: the speaker who authored the utterance * conversation_id: ID of meeting * reply_to: id of the sequentially prior utterance (None for the first utterance of a meeting) * text: textual content of the utterance * timestamp: calculated value based off the date of the meeting and the speech index Metadata for utterances include: * speech_index: index of utterance in the context of the conversation * parsed: parsed version of the utterance text, represented as a SpaCy Doc Conversational-level information ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ Conversations are indexed by a string representing the meeting date. Usage ----------- To download directly with ConvoKit: >>> from convokit import Corpus, download >>> corpus = Corpus(filename=download("fomc-corpus")) For some quick stats: >>> corpus.print_summary_stats() Number of Speakers: 364 Number of Utterances: 108504 Number of Conversations: 268 Additionally, if you want to process the original FOMC data into ConvoKit format you can use the following script `Converting FOMC Corpus to ConvoKit Format `_ Additional note --------------- The original dataset can be downloaded `here `_. Refer to the original README for more explanations on dataset construction. Contact ^^^^^^^ Please email any questions to: cristian@cs.cornell.edu (Cristian Danescu-Niculescu-Mizil).