Agnieszka Kitkowska

20.9CRJul 7

Security and Privacy in Agentic AI: Grand Challenges and Future Directions

Adam Jenkins, Agnieszka Kitkowska, Caterina Maidhof et al.

We present key challenges and future research directions in the security and privacy of agentic AI, based on a horizon-scanning exercise that brought together thirty leading international experts from academia, industry, and government to engage in focused discussions and collaborative exercises on the emerging risks associated with the growing agency of AI.

6.6HCApr 21

Discerning Authorship in Online Health Communities: Experience, Trust, and Transparency Implications for Moderating AI

Yefim Shulman, Agnieszka Kitkowska, Mark Warner

For online health communities, community trust is paramount. Yet, advances in Large Language Models (LLMs) generating advice may erode this trust, especially if users cannot identify whether LLMs have been used. We investigate the feasibility of community-based detection of health advice authorship and how self-moderation of LLMs could help enhance advice utilization. In an online experiment, we evaluate people's ability to distinguish AI-generated from human-written advice across two health conditions, considering lived experience with a condition, AI-recognition training, and user attitudes towards transparency and trust around AI use. Our results indicate the need for transparency coupled with trust. We find little evidence of people's ability to discern advice authorship. However, we find a consistent effect of the health condition. Our qualitative findings identify unreliable signals, resulting in flawed heuristic evaluations of the advice. Our findings point to opportunities to improve the self-moderation of LLM-based AI and aid community-based AI moderation.

Agnieszka Kitkowska

2 Papers