Cornell Conversational Analysis Toolkit (ConvoKit) Documentation¶

This toolkit contains tools to extract conversational features and analyze social phenomena in conversations, using a single unified interface inspired by (and compatible with) scikit-learn. Several large conversational datasets are included together with scripts exemplifying the use of the toolkit on these datasets.

More information can be found at our website. The latest version is 4.1.2 (released June 26, 2026).