Skip to content
Snippets Groups Projects
  1. Feb 14, 2022
  2. Feb 04, 2022
    • ulif's avatar
      Remove discriminating terms (fixes #85) · 76846688
      ulif authored
      Some wordlists contained vulgar, discriminating, and in in other
      respects bad language. We start to fix that by removing/replacing terms
      found by @kmille. But there are certainly more words to remove and we
      must face the problem, that it is not easy to generate proper welcoming
      wordlists automatically. Any further hints/help welcome.
      76846688
    • ulif's avatar
      Bump copyright year. · aeb758d7
      ulif authored
      aeb758d7
    • ulif's avatar
      Also cover py3.9/3.10 in `tox`. · 26c55f1f
      ulif authored
      Additional fixes for python3.4 `tox` runs.
      26c55f1f
  3. Sep 09, 2021
  4. Mar 27, 2021
    • ulif's avatar
      Support Python 3.8 officially. · ff5ec702
      ulif authored
      Retract support for pypy2, Python 2.6, and Python 3.3. Less officially
      all retracted versions still work but automatied testing them becomes
      more and more cumbersome (ending support from setuptools, pytest, pip,
      virtualenv, ....).
      
      We also support Python 3.9, although `tox` refuses to setup a respective
      env.
      ff5ec702
    • ulif's avatar
      Update man page. · 9ef46905
      ulif authored
      Show new option (multiple wordlists) and add infos about the new lists.
      9ef46905
    • ulif's avatar
      Mention @drebs at copyright holder of pt-BR list. · ae53c926
      ulif authored
      ae53c926
  5. Mar 24, 2021
    • ulif's avatar
      Update README. · ad66e31d
      ulif authored
      Reflect changes in output of ``--help``.
      ad66e31d
  6. Mar 16, 2021
  7. Mar 15, 2021
  8. Sep 25, 2020
  9. Aug 24, 2020
  10. Aug 19, 2020
  11. Jul 26, 2020
  12. Apr 24, 2020
  13. Apr 20, 2020
  14. Apr 19, 2020
  15. Apr 16, 2020
  16. Jan 17, 2020
  17. Dec 21, 2019
  18. May 28, 2019
    • drebs's avatar
      Improve pt-br wordlist (#63) · 9c101bee
      drebs authored and ulif's avatar ulif committed
      Wordlist in pt-br was first introduced in 7743ed55. The differences to
      this one are:
      
        - 9-characters words are introduced.
        - suffixes removal is made after accounting for popularity.
        - less frequent words that differ only in the last character are
          removed.
      
      The current pt-br wordlist was generated as follows:
      
        1. Download a dump of portuguese Wikipedia pages, process all pages
           and determine the frequency of each word.
        2. Start from /usr/share/dict/brazilian and filter out:
             - words not matching /^[a-z]+$/,
             - words shorter than 4 characters, and
             - words longer than 9 characters.
        3. Sort remaining words using pt Wikipedia frequencies.
        4. Take the top 30K words (just because after filtering we still get
           roughly the amount we need).
        5. Filter out:
             - all words that are a suffix of any other word in the list.
             - less frequent words that differ only by the last character.
        6. Take the 7776 most frequent words.
      
      No further curation was made.
      9c101bee
  19. Apr 26, 2019
    • ulif's avatar
      Update year. · 6a1a7623
      ulif authored
      Next release won't be in 2018.
      6a1a7623
Loading