Abstract
Tenant-landlord relationships exhibit a power asymmetry where landlords’ power to evict the tenants at a low-cost results in their dominating status in such relationships. Tenant concerns are thus often unspoken, unresolved, or ignored and this could lead to blatant conflicts as suppressed tenant concerns accumulate. Modern machine learning methods and Large Language Models (LLM) have demonstrated immense abilities to perform language tasks. In this study, we incorporate Latent Dirichlet Allocation with GPT-4 to classify Reddit post data scraped from the subreddit r/Tenant, aiming to unveil trends in tenant concerns while exploring the adoption of LLMs and machine learning methods in social science research. We find that tenant concerns in topics like fee dispute and utility issues are consistently dominant in all four states analyzed while each state has other common tenant concerns special to itself. Moreover, we discover temporal trends in tenant concerns that provide important implications regarding the impact of the pandemic and the Eviction Moratorium.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Data Availability Statement
The data that support the findings of this study are not openly available. However, the same data can be accessed directly from Reddit through official or third-party APIs.
Notes
Reddit Data API Update: Changes to Pushshift Access https://www.reddit.com/r/modnews/comments/134tjpe/reddit_data_api_update_changes_to_pushshift_access/.
OpenAI API Reference Create chat completion https://platform.openai.com/docs/api-reference/chat/create.
References
Gomory, H., Massey, D.S., Hendrickson, J.R., & Desmond, M. (2023). When It’s cheap to file an eviction case, tenants pay the price. evictionlab.org/tenants-pay-for-cheap-evictions (Accessed July 2023). https://evictionlab.org/tenants-pay-for-cheap-evictions/.
Kapolka, A. G. (2022). Landlord-Tenant Relationships and the COVID-19 Pandemic: A Qualitative Exploration of Landlord Power and the Eviction Moratorium. Yale University.
Chisholm, E., Howden-Chapman, P., & Fougere, G. (2020). Tenants’ responses to substandard housing: Hidden and invisible power and the failure of rental housing regulation. Housing, Theory and Society, 37(2), 139–161. https://doi.org/10.1080/14036096.2018.1538019
Si, C., Gan, Z., Yang, Z., Wang, S., Wang, J., Boyd-Graber, J., & Wang, L. (2023). Prompting GPT-3 to be Reliable. arXiv preprint. Available at arxiv.org/abs/2210.09150 (Accessed July 2023).
Palatucci, M. (2009). Zero-shot learning with semantic output codes. In: Neural Information Processing Systems.
Ziems, C., Held, W., Shaikh, O., Chen, J., Zhang, Z., & Yang, D. (2023). Can large language models transform computational social science? arXiv preprint. Available at arxiv.org/abs/2305.03514 (Accessed July 2023).
Chiu, K.L., & Alexander, R. (2021). Detecting hate speech with GPT-3. CoRR abs/2103.12407. https://arxiv.org/abs/2103.12407.
Egami, N., Jacobs-Harukawa, M., Stewart, B.M., & Wei, H. (2023). Using large language model annotations for valid downstream statistical inference in social science: Design-based semi-supervised learning. arXiv preprint. Available at arxiv.org/abs/2306.04746 (Accessed July 2023).
Guo, F. (2023). GPT agents in game theory experiments. arXiv preprint. Available at https://doi.org/10.48550/arXiv.2305.05516 (Accessed July 2023).
Paoli, S. D. (2023). Can large language models emulate an inductive thematic analysis of semi-structured interviews? An exploration and provocation on the limits of the approach and the model. arXiv preprint. Available at arxiv.org/abs/2305.13014 (Accessed July 2023).
U.S. Census Bureau. (n.d.). Methodology: American Housing Survey (AHS). U.S. Department of Commerce. Retrieved January 3rd, 2025, from https://www.census.gov/programs-surveys/ahs/about/methodology.html
Edward Coulson, N., Le, T., & Shen, L. (2020). Tenant rights, eviction, and rent affordability. SSRN Electronic Journal. https://doi.org/10.2139/ssrn.3641859
Chapple, K., Tim T., & Miriam Z. (2021). Urban displacement project website. www.urbandisplacement.org (Accessed April 2023). https://www.urbandisplacement.org.
Desmond, M. (2016). Evicted: Poverty and Profit in the American City. Crown.
Griswold, M., Holliday, S.B., Sizemore, A., Ren, C., Baker, L., Howell, K., Osoba, O., Williams, J., Ward, J.M., & Hunter, S.B. (2023). An evaluation of crime-free housing policies. RAND.
Holl, M., Van Den Dries, L., & Wolf, J. R. L. M. (2016). Interventions to prevent tenant evictions: A systematic review. Health & Social Care in the Community, 24(5), 532–546.
Thomas, T., Ramiller, A., Ren, C., & Toomet, O. (2024). Toward a national eviction data collection strategy using natural language processing. Cityscape, 26(1), 241–260.
Seron, C., Frankel, M., Van Ryzin, G., & Kovath, J. (2001). The impact of legal counsel on outcomes for poor tenants in New York City’s housing court: Results of a randomized experiment. Law and Society Review., 35, 419–434.
Merritt, B., & Farnworth, M. D. (2021). State landlord-tenant policy and eviction rates in majority-minority neighborhoods. Housing Policy Debate, 31(3–5), 562–581. https://doi.org/10.1080/10511482.2020.1828989
Bates, L. (2020). Stability, equity, and dignity: reporting and reflecting on Oregon tenant experiences during the COVID-19 pandemic. Homelessness Research & Action Collaborative Publications and Presentations.
Byrne, M., & Sassi, J. (2022). Making and unmaking home in the COVID-19 pandemic: A qualitative research study of the experience of private rental tenants in Ireland. International Journal of Housing Policy, 23(3), 1–20. https://doi.org/10.1080/19491247.2022.2037176
Bang, Y., Cahyawijaya, S., Lee, N., Dai, W., Su, D., Wilie, B., Lovenia, H., Ji, Z., Yu, T., Chung, W. Do, Q.V., Xu, Y., & Fung, P. (2023). A multitask, multilingual, multimodal evaluation of ChatGPT on reasoning, hallucination, and interactivity. arXiv preprint. Available at arxiv.org/abs/2206.07682 (Accessed July 2023).
Wei, J., Tay, Y., Bommasani, R., Raffel, C., Zoph, B., Borgeaud, S., Yogatama, D., Bosma, M., Zhou, D., Metzler, D., & Chi, E.H., Tatsunori H., Oriol V., Percy L., Jeff D., & William F. (2022). Emergent Abilities of Large Language Models. arXiv preprint. Available at arxiv.org/abs/2206.07682 (Accessed July 2023).
Amaya, A., Bach, R., Keusch, F., & Kreuter, F. (2021). New data sources in social science research: Things to know before working with Reddit data. Social Science Computer Review, 39(5), 943–960.
Jamnik, M. R., & Lane, D. J. (2017). The use of Reddit as an inexpensive source for high-quality data. Practical Assessment, Research, and Evaluation, 22(1), 5.
Kairam, S., Bernstein, M.S., Bruckman, A.S., Chancellor, S., Chandrasekharan, E., De Choudhury, M., Fiesler, C., Li, H., Proferes, N., Horta Ribeiro, M., Smith, C.E., & Galen C. W. (2024). Community-driven models for research on social platforms. In: Companion Publication of the 2024 Conference on Computer-Supported Cooperative Work and Social Computing (CSCW Companion ‘24). Association for Computing Machinery, pp. 684–688. https://doi.org/10.1145/3678884.3687141
Hintz, E. A., & Betts, T. (2022). Reddit in communication research: Current status, future directions and best practices. Annals of the International Communication Association, 46(2), 116–133.
De Choudhury, M., & De, S. (2014). Mental health discourse on Reddit: Self-disclosure, social support, and anonymity. In: Proceedings of the International AAAI Conference on Web and Social Media. International AAAI Conference on Web and Social Media, (pp. 71–80).
McDowall, L., Antoniak, M., & Mimno, D. (2023). Sensemaking about contraceptive methods across online platforms. arXiv preprint. Available at arxiv:2301.09295 (Accessed July 2023).
Pleasants, E., Ryan, J. H., Ren, C., Prata, N., Gomez, A. M., & Marshall, C. (2023). Exploring Language Used in Posts on r/birthcontrol: Case Study Using Data From Reddit Posts and Natural Language Processing to Advance Contraception Research. Journal of medical Internet research, 25, e46342. https://doi.org/10.2196/46342
Sang, E.F., De Meulder, F. (2003). Introduction to the CoNLL-2003 shared task: Language-independent named entity recognition. In: Proceedings of the Seventh Conference on Natural Language Learning at HLT-NAACL 2003, 142–147. https://www.aclweb.org/anthology/W03-0419.
Blei, D. M., Ng, A. Y., & Jordan, M. I. (2003). Latent Dirichlet allocation. Journal of Machine Learning Research, 3(January), 993–1022.
Hong, L., & Davison, B.D. (2010). Empirical study of topic modeling in twitter. SOMA ‘10, pp. 80–88. Association for Computing Machinery. https://doi.org/10.1145/1964858.1964870. ISBN 9781450302173.
Perez, E., Kiela, D., & Cho, K. (2021). True few-shot learning with language models. In: M. Ranzato, A. Beygelzimer, Y. Dauphin, P. S. Liang, J. Wortman Vaughan (Eds.), Advances in Neural Information Processing Systems, Curran Associates, Inc, (pp. 11054–11070).
Deng, X., Bashlovkina, V., Han, F., Baumgartner, S., & Bendersky, M. (2022). What Do LLMs know about financial markets? A case study on reddit market sentiment analysis. arXiv preprint. Available at arxiv.org/abs/2212.11311 (Accessed July 2023).
Ma, S., Sun, X., Lin, J., & Ren, X. (2018). A hierarchical end-to-end model for jointly improving text summarization and sentiment classification. arXiv preprint arXiv:1805.01089.
Tang, D., Qin, B., & Liu, T. (2015). Document modeling with gated recurrent neural network for sentiment classification. In: Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, pp. 1422–1432.
Mu, Y., Wu, B.P., Thorne, W., Robinson, A., Aletras, N., Scarton, C., Bontcheva, K., & Song, X. (2023). Navigating prompt complexity for zero-shot classification: a study of large language models in computational social science. arXiv preprint. Available at arxiv.org/abs/2305.14310 (Accessed July 2023).
Herlocker, J. L., Konstan, J. A., Terveen, L. G., & Riedl, J. T. (2004). Evaluating collaborative filtering recommender systems. ACM Transactions on Information Systems, 22(1), 5–53. https://doi.org/10.1145/963770.963772
Li, Z., Ren, C., Li, X., & Pardos, Z.A.. (2021). Learning skill equivalencies across platform taxonomies. In: LAK21: 11th International Learning Analytics and Knowledge Conference, pp. 354–363.
Fleiss, J. L. (1971). Measuring nominal scale agreement among many raters. Psychological Bulletin, 76(5), 378–382. https://doi.org/10.1037/h0031619
Achiam, J., Adler, S., Agarwal, S., Ahmad, L., Akkaya, I., Aleman, F.L., Almeida, D., Altenschmidt, J., Altman, S., & Anadkat, S. et al. (2023). GPT-4 Technical Report. arXiv preprint arXiv:2303.08774.
Tsai, J., Huang, M., Blosnich, J. R., & Elbogen, E. B. (2022). Evictions and tenant-landlord relationships during the 2020–2021 eviction moratorium in the US. American Journal of Community Psychology, 70(1–2), 117–126.
Torresin, S., Albatici, R., Aletta, F., Babich, F., Oberman, T., Stawinoga, A. E., & Kang, J. (2022). Indoor soundscapes at home during the COVID-19 lockdown in London–Part II: A structural equation model for comfort, content, and well-being. Applied Acoustics, 185, 108379. https://doi.org/10.1016/j.apacoust.2021.108379
Dümen, A. Ş, & Şaher, K. (2020). Noise annoyance during COVID-19 lockdown: A research of public opinion before and during the pandemic. The Journal of the Acoustical Society of America, 148(6), 3489–3496. https://doi.org/10.1121/10.0002667
Sumagaysay, L. (2020). California eviction Moratorium Is ‘a real nightmare’ for renters to understand—here’s what you need to know. www.marketwatch.com (Accessed July 2023). https://www.marketwatch.com.
Yan, L., Sha, L., Zhao, L., Li, Y., Martinez‐Maldonado, R., Chen, G., Li, X., Jin, Y., & Gašević, D. (2023). Practical and ethical challenges of large language models in education: A systematic literature review. arXiv preprint. Available at https://arxiv.org/abs/2303.13379 (Accessed July 2023).
Wang, R.E., & Demszky, D. (2023). Is ChatGPT a good teacher coach? Measuring Zero-Shot performance for scoring and providing actionable insights on classroom instruction.
Hess, C., & Chasins, S.E. (2022). Informing housing policy through web automation: Lessons for designing programming tools for domain experts. In: Extended Abstracts of the 2022 CHI Conference on Human Factors in Computing Systems. CHI EA ‘22. New York, NY, USA: Association for Computing Machinery. https://doi.org/10.1145/3491101.3503575. ISBN 9781450391566.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no relevant financial or non-financial interests to disclose. Furthermore, there are no conflicts of interest related to the content of this article.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Chen, X., Ren, C. & Thomas, T.A. Evaluating tenant-landlord tensions using generative AI on online tenant forums. J Comput Soc Sc 8, 50 (2025). https://doi.org/10.1007/s42001-025-00378-8
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s42001-025-00378-8