Speaker¶
-
class
convokit.model.speaker.
Speaker
(owner=None, id: str = None, utts=None, convos=None, meta: Optional[Dict] = None)¶ Represents a single speaker in a dataset.
- Parameters
id (str) – id of the speaker.
utts – dictionary of utterances by the speaker, where key is utterance id
convos – dictionary of conversations started by the speaker, where key is conversation id
meta (dict) – arbitrary dictionary of attributes associated with the speaker.
- Variables
id – id of the speaker.
meta – A dictionary-like view object providing read-write access to speaker-level metadata.
-
add_meta
(key: str, value) → None¶ Adds a key-value pair to the metadata of the corpus object :param key: name of metadata attribute :param value: value of metadata attribute :return: None
-
add_vector
(vector_name: str)¶ Logs in the Corpus component object’s internal vectors list that the component object has a vector row associated with it in the vector matrix named vector_name. Transformers that add vectors to the Corpus should use this to update the relevant component objects during the transform() step. :param vector_name: name of vector matrix :return: None
-
delete_vector
(vector_name: str)¶ Delete a vector associated with this Corpus component object. :param vector_name: :return: None
-
get_conversation
(cid: str)¶ Get the Conversation with the specified Conversation id
- Parameters
cid – The id of the Conversation
- Returns
A Conversation object
-
get_conversation_ids
(selector=<function Speaker.<lambda>>) → List[str]¶ - Returns
a List of the ids of Conversations started by the speaker
-
get_conversations_dataframe
(selector=<function Speaker.<lambda>>, exclude_meta: bool = False)¶ Get a DataFrame of the Conversations the Speaker has participated in, with fields and metadata attributes. Set an optional selector that filters for Conversations that should be included. Edits to the DataFrame do not change the corpus in any way.
- Parameters
exclude_meta – whether to exclude metadata
selector – a (lambda) function that takes a Conversation and returns True or False (i.e. include / exclude). By default, the selector includes all Conversations in the Corpus.
- Returns
a pandas DataFrame
-
get_utterance
(ut_id: str)¶ Get the Utterance with the specified Utterance id
- Parameters
ut_id – The id of the Utterance
- Returns
An Utterance object
-
get_utterance_ids
(selector=<function Speaker.<lambda>>) → List[str]¶ - Returns
a List of the ids of Utterances made by the speaker
-
get_utterances_dataframe
(selector=<function Speaker.<lambda>>, exclude_meta: bool = False)¶ Get a DataFrame of the Utterances made by the Speaker with fields and metadata attributes. Set an optional selector that filters for Utterances that should be included. Edits to the DataFrame do not change the corpus in any way.
- Parameters
exclude_meta – whether to exclude metadata
selector – a (lambda) function that takes a Utterance and returns True or False (i.e. include / exclude). By default, the selector includes all Utterances in the Corpus.
- Returns
a pandas DataFrame
-
get_vector
(vector_name: str, as_dataframe: bool = False, columns: Optional[List[str]] = None)¶ Get the vector stored as vector_name for this object. :param vector_name: name of vector :param as_dataframe: whether to return the vector as a dataframe (True) or in its raw array form (False). False
by default.
- Parameters
columns – optional list of named columns of the vector to include. All columns returned otherwise. This parameter is only used if as_dataframe is set to True
- Returns
a numpy / scipy array
-
iter_conversations
(selector=<function Speaker.<lambda>>)¶ - Returns
An iterator of the Conversations that the speaker has participated in
-
iter_utterances
(selector=<function Speaker.<lambda>>)¶ Get utterances made by the Speaker, with an optional selector that selects for Utterances that should be included.
- param selector
a (lambda) function that takes an Utterance and returns True or False (i.e. include / exclude). By default, the selector includes all Utterances in the Corpus.
- Returns
An iterator of the Utterances made by the speaker
-
print_speaker_stats
()¶ Helper function for printing the number of Utterances made and Conversations participated in by the Speaker.
- Returns
None (prints output)
-
retrieve_meta
(key: str)¶ Retrieves a value stored under the key of the metadata of corpus object :param key: name of metadata attribute :return: value