n-gram + discord

This is a CLI program that builds a n-gram language model on a character level for both users from their discord DM's.

How to use it?

C++20 and Tyrrrz/DiscordChatExporter are required.

Clone this repository git clone git@github.com:kiletic/ngram-discord.git
Enter the repository folder and export your discord DM's into a data.txt file using DiscordChatExporter CLI tool with the following command:
dotnet DiscordChatExporter.Cli.dll export --token "INSERT_YOUR_TOKEN_HERE" -c INSERT_CHANNEL_ID -f "PlainText" -o data.txt

Note: Make sure you enter your discord token into INSERT_YOUR_TOKEN_HERE and the channel id into INSERT_CHANNEL_ID. How to get the token and the channel_id?

Compile main.cpp with: c++ -std=c++20 -O3 main.cpp -o main
Run the program: ./main INSERT_USER1_NAME INSERT_USER2_NAME

Note: Inserted names should be actual discord usernames as they appear in data.txt, and not display names. It doesn't matter in which order you write them in.

How does it work?

It parses the data.txt file for messages from both users, optionally skipping messages that contain embeddings or attachments (this can be changed by setting filter_attachments and filter_embeds to false when calling the filter_data function). Then, it builds a n-gram language model on a character level from the messages (to read more about the math check wiki). Finally, to use the model just call the generate_sentence function. Feel free to play with the value of n to get different results.

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
.editorconfig		.editorconfig
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
main.cpp		main.cpp

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

n-gram + discord

How to use it?

How does it work?

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

n-gram + discord

How to use it?

How does it work?

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages