Symptom Detection with Text Message Log Distributions for Holistic Depression and Anxiety Screening
Abstract
Major Depressive Disorder (MDD) and Generalized Anxiety Disorder (GAD) are both heterogeneous in their clinical presentations, manifesting with unique symptom profiles. Despite this, prior digital phenotype research has primarily focused on disorder-level detection rather than symptom-level detection. In this research, we predict the existence of individual symptoms of MDD and GAD with SMS log metadata, and ensemble these symptom-level classifiers to screen for depression and anxiety, thus accounting for disorder heterogeneity. Further, we collect an additional dataset of retrospectively harvested SMS logs to augment an existing dataset collected after COVID-19 altered communication patterns, and propose two new types of distribution features: consecutive messages and conversation ratio. Our symptom-level detectors achieved a balanced accuracy of 0.7 in 13 of the 16 MDD and GAD symptoms, with reply latency distribution features achieving a balanced accuracy of 0.78 when detecting anxiety symptom trouble relaxing. When combined into disorder-level ensembles, these symptom-level detectors achieved a balanced accuracy of 0.76 for depression screening and 0.73 for anxiety screening, with tree boosting methods demonstrating particular efficacy. Accounting for disorder heterogeneity, our research provides insight into the value of SMS logs for the assessment of depression and anxiety diagnostic criteria.
BibTeX
@article{10.1145/3643554, author = {Tlachac, ML and Heinz, Michael and Reisch, Miranda and Ogden, Samuel S.}, title = {Symptom Detection with Text Message Log Distributions for Holistic Depression and Anxiety Screening}, year = {2024}, issue_date = {March 2024}, publisher = {Association for Computing Machinery}, address = {New York, NY, USA}, volume = {8}, number = {1}, url = {https://doi.org/10.1145/3643554}, doi = {10.1145/3643554}, abstract = {Major Depressive Disorder (MDD) and Generalized Anxiety Disorder (GAD) are both heterogeneous in their clinical presentations, manifesting with unique symptom profiles. Despite this, prior digital phenotype research has primarily focused on disorder-level detection rather than symptom-level detection. In this research, we predict the existence of individual symptoms of MDD and GAD with SMS log metadata, and ensemble these symptom-level classifiers to screen for depression and anxiety, thus accounting for disorder heterogeneity. Further, we collect an additional dataset of retrospectively harvested SMS logs to augment an existing dataset collected after COVID-19 altered communication patterns, and propose two new types of distribution features: consecutive messages and conversation ratio. Our symptom-level detectors achieved a balanced accuracy of 0.7 in 13 of the 16 MDD and GAD symptoms, with reply latency distribution features achieving a balanced accuracy of 0.78 when detecting anxiety symptom trouble relaxing. When combined into disorder-level ensembles, these symptom-level detectors achieved a balanced accuracy of 0.76 for depression screening and 0.73 for anxiety screening, with tree boosting methods demonstrating particular efficacy. Accounting for disorder heterogeneity, our research provides insight into the value of SMS logs for the assessment of depression and anxiety diagnostic criteria.}, journal = {Proc. ACM Interact. Mob. Wearable Ubiquitous Technol.}, month = {mar}, articleno = {19}, numpages = {28}, keywords = {AdaBoost, XGBoost, digital phenotype, mental health assessment, mobile health} }