Publications

2026

Rajwal, Swati, Avinash Kumar Pandey, Ziyuan Zhang, Yankai Chen, Michael X Liu, Sudeshna Das, Hannah Rogers, Abeed Sarker, and Yunyu Xiao. (2026) 2026. “Applications of Natural Language Processing and Large Language Models for Social Determinants of Health: Systematic Review.”. Journal of Medical Internet Research 28: e83793. https://doi.org/10.2196/83793.

BACKGROUND: Social determinants of health (SDOH) are the social, economic, and environmental conditions that influence health outcomes. SDOH information is often embedded in unstructured text, such as notes in electronic health records and social media posts. Advances in natural language processing (NLP), including emergent large language models (LLMs), offer opportunities to extract, analyze, and interpret SDOH expressions from free text for inclusion in downstream analyses. Existing literature on NLP applications for SDOH is dispersed across disciplines and characterized by methodological heterogeneity and variability in study quality and scope, complicating synthesis and cross-study comparison.

OBJECTIVE: This study aimed to examine the use of NLP, including LLMs, in SDOH research, and highlight gaps and future research directions.

METHODS: We conducted a systematic review following PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) guidelines, searching 7 major databases for publications between 2014 and November 2025. We included journal and conference proceedings papers that applied NLP methods to identify, classify, extract, or predict SDOH from text. Three reviewers independently screened studies and extracted data; conflicts were resolved by two senior reviewers. We abstracted study metadata, dataset characteristics, NLP approaches, SDOH domains addressed, and NLP performance metrics. We also conducted risk-of-bias analyses and identified influential studies based on relative citation counts.

RESULTS: 142 studies met the inclusion criteria. Nearly two-thirds (89/142, 62.7%) were published between 2023 and 2025, reflecting rapid recent growth. Most studies relied on electronic health records (93/142, 65.5%) and private datasets (81/142, 57.0%), while only 20.4% (29/142) used publicly available data. Commonly studied SDOH domains were housing instability (72/142, 50.7%), employment (65/142, 45.8%), and financial conditions (63/142, 44.4%); structural factors, such as immigration status (5/142, 3.5%), were rarely examined. Of studies that reported evaluation metrics, most focused on classification (26/83, 31.32%) or extraction (38/83, 45.7%), and used cross-sectional designs. Reported model performances were typically strong, with median F1-scores ranging roughly from 0.75 to 0.85 across model categories. Only 49 studies shared code, and fewer than half clearly described model interpretability or reproducibility practices. LLMs (including encoder-decoder models) appeared in 19.7% (28/142) of studies, highlighting emerging interest but also raising new concerns around transparency and governance.

CONCLUSIONS: This review provides a timely synthesis of NLP and LLM applications across the SDOH research spectrum, addressing an important gap in a topic receiving increasing research attention. By comparing task formulations, data sources, and performance patterns, the review clarifies the research readiness of current approaches and reveals critical gaps. Our findings advance the field by highlighting the absence of a unified SDOH framework, uneven availability of public benchmarks, and limited evaluation of real-world deployment. Addressing these gaps through transparent, inclusive dataset development and implementation-focused evaluation is essential for translating NLP advances into equitable, real-world health impact.

2025

Walker, Andrew, Jerik Leung, Aishwarya Alagappan, Swati Rajwal, Sahithi Lakamana, Tricia Park, Nathan Le, et al. (2025) 2025. “Centering Patient Voices in Lupus Pain: A Biopsychosocial Analysis of Reddit Narratives Using Large Language Models.”. Arthritis Care & Research. https://doi.org/10.1002/acr.25687.

OBJECTIVE: Patients with chronic illness share their experiences in online communities, generating rich data on pain management. This study applied natural language processing methods, including large language models, to Reddit discussions from lupus communities to characterize multidimensional pain experiences framed in the biopsychosocial model.

METHODS: We extracted Reddit posts from the r/Lupus and r/LupusSupport subreddits posted from June 9, 2010 through December 31, 2023. Pain-related posts were identified using a clinically informed pain lexicon. Topic modeling was used to identify thematic patterns, which were then compared to structured summaries generated by an LLM instruction fine-tuned using the biopsychosocial model of pain. Two reviewers conducted content analysis of the LLM-generated summaries, evaluating thematic accuracy and coverage.

RESULTS: Data from Reddit included 31,785 posts, from 10,857 authors. We identified common pain complaints, management strategies, and sociocultural, affective, and nociplastic dimensions of pain. Instruction fine-tuned LLMs produced structured summaries with an average thematic accuracy score of 3.1 out of 4 (kappa = .09) and content coverage score of 2.9 out of 4 (kappa = .38). Sociocultural features presented in 123 posts (33.8%), including peer support and validation (n=106) and provider interactions or access issues (n=35). Nociplastic pain presented in 205 posts (56.3%).

CONCLUSION: NLP methods can be used to extract rich, multidimensional insights about pain experiences from online communities focused on lupus. These approaches highlight the psychological, social, and cultural facets of pain that may be underrepresented in clinical settings, supporting more patient-centered approaches to care in rheumatology.