HindiPersonalityNet: Personality Detection in Hindi Conversational Data using Deep Learning with Static Embedding

Kumar, Akshi; Jain, Dipika and Beniwal, Rohit. 2023. HindiPersonalityNet: Personality Detection in Hindi Conversational Data using Deep Learning with Static Embedding. ACM Transactions on Asian and Low-Resource Language Information Processing, ISSN 2375-4699 [Article] (In Press)

[img]
Preview
Text
Shakhsiyat_TALLIP_Main.pdf - Accepted Version

Download (556kB) | Preview

Abstract or Description

Personality detection along with other behavioural and cognitive assessment can essentially explain why people act the way they do and can be useful to various online applications such as recommender systems, job screening, matchmaking, and counselling. Additionally, psychometric NLP relying on textual cues and distinctive markers in writing style within conversational utterances reveal signs of individual personalities. This work demonstrates a text-based deep neural model, HindiPersonalityNet of classifying conversations into three personality categories {ambivert, extrovert, introvert} for detecting personality in Hindi conversational data. The model utilizes GRU with BioWordVec embeddings for text classification and is trained/tested on a novel dataset, शख्सियत (pronounced as Shakhsiyat) curated using dialogues from an Indian crime-thriller drama series, Aarya. The model achieves an F1-score of 0.701 and shows the potential for leveraging conversational data from various sources to understand and predict a person's personality traits. It exhibits the ability to capture semantic as well as long-distance dependencies in conversations and establishes the effectiveness of our dataset as a benchmark for personality detection in Hindi dialogue data. Further, a comprehensive comparison of various static and dynamic word embedding is done on our standardized dataset to ascertain the most suitable embedding method for personality detection.

Item Type:

Article

Identification Number (DOI):

https://doi.org/10.1145/3625228

Additional Information:

"© 2023 Copyright held by the owner/author(s). This is the author's version of the work. It is posted here for your personal use. Not for redistribution. The definitive Version of Record is available at, https://doi.org/10.1145/3625228."

Keywords:

Personality, low-resource, deep learning, word embeddings, NLP

Departments, Centres and Research Units:

Computing

Dates:

DateEvent
16 September 2023Accepted
29 September 2023Published Online

Item ID:

34138

Date Deposited:

02 Oct 2023 08:26

Last Modified:

03 Oct 2023 07:47

Peer Reviewed:

Yes, this version has been peer-reviewed.

URI:

https://research.gold.ac.uk/id/eprint/34138

View statistics for this item...

Edit Record Edit Record (login required)