AICurious Logo

What is: Meena?

SourceTowards a Human-like Open-Domain Chatbot
Year2000
Data SourceCC BY-SA - https://paperswithcode.com

Meena is a multi-turn open-domain chatbot trained end-to-end on data mined and filtered from public domain social media conversations. This 2.6B parameter neural network is simply trained to minimize perplexity of the next token. A seq2seq model is used with the Evolved Transformer as the main architecture. The model is trained on multi-turn conversations where the input sequence is all turns of the context and the output sequence is the response.