scripts/data-collection/twitter/README.md
Conversations on Twitter can be an interesting and useful source of data for our model to learn from. Certain twitter threads may contain helpful prompts and replies, in a similar fashion to how we want our model to be able to respond to prompts in a useful way.
Thus, these scripts are intended to process twitter data from a variety of sources, process them into cleaner and more useful formats, and then combine the various outputs into a unified training set that can be fed to our model as a conversation, or at least as a prompt with replies.
Note: Based on issue #126