Edu-ConvoKit: An Open-Source Library for Education Conversation Data
This provides a tool for researchers and practitioners in education to more easily handle conversation data, though it is incremental as it builds on existing library concepts.
The authors tackled the scarcity of resources for analyzing education conversation data by introducing Edu-ConvoKit, an open-source library for pre-processing, annotation, and analysis, which is pip-installable and includes comprehensive documentation and demo resources.
We introduce Edu-ConvoKit, an open-source library designed to handle pre-processing, annotation and analysis of conversation data in education. Resources for analyzing education conversation data are scarce, making the research challenging to perform and therefore hard to access. We address these challenges with Edu-ConvoKit. Edu-ConvoKit is open-source (https://github.com/stanfordnlp/edu-convokit ), pip-installable (https://pypi.org/project/edu-convokit/ ), with comprehensive documentation (https://edu-convokit.readthedocs.io/en/latest/ ). Our demo video is available at: https://youtu.be/zdcI839vAko?si=h9qlnl76ucSuXb8- . We include additional resources, such as Colab applications of Edu-ConvoKit to three diverse education datasets and a repository of Edu-ConvoKit related papers, that can be found in our GitHub repository.