We have indexed over 30 million messages from the publicly available /pol/ message boards on 4chan and 8chan, and compiled them into a model of toxic language use.
The trained word embeddings (±0.4GB) are available and may be useful for further study on toxic discourse or to boost hate speech detection systems.
Report available upon request.