Skip to content
Snippets Groups Projects
  1. Dec 21, 2019
  2. May 28, 2019
    • drebs's avatar
      Improve pt-br wordlist (#63) · 9c101bee
      drebs authored and ulif's avatar ulif committed
      Wordlist in pt-br was first introduced in 7743ed55. The differences to
      this one are:
      
        - 9-characters words are introduced.
        - suffixes removal is made after accounting for popularity.
        - less frequent words that differ only in the last character are
          removed.
      
      The current pt-br wordlist was generated as follows:
      
        1. Download a dump of portuguese Wikipedia pages, process all pages
           and determine the frequency of each word.
        2. Start from /usr/share/dict/brazilian and filter out:
             - words not matching /^[a-z]+$/,
             - words shorter than 4 characters, and
             - words longer than 9 characters.
        3. Sort remaining words using pt Wikipedia frequencies.
        4. Take the top 30K words (just because after filtering we still get
           roughly the amount we need).
        5. Filter out:
             - all words that are a suffix of any other word in the list.
             - less frequent words that differ only by the last character.
        6. Take the 7776 most frequent words.
      
      No further curation was made.
      9c101bee
  3. Apr 26, 2019
    • drebs's avatar
      Add pt-br wordlist (#60) · 7743ed55
      drebs authored and ulif's avatar ulif committed
      The wordlist was generated from 2 different sources of words:
      
        - The file /usr/share/dict/brazilian from Debian's wbrazilian package.
        - A dump of the pages of Wikipedia in portuguese.
      
      The final pt-br wordlist was generated as follows:
      
        1. Download a dump of portuguese Wikipedia pages, process all pages
           and determine the frequency of each word.
        2. Start from /usr/share/dict/brazilian and filter out:
             - words not matching /^[a-z]+$/,
             - words shorter than 4 characters, and
             - words longer than 8 characters.
        3. Remove all words that are a suffix of any other word in the list.
        4. Sort remaining words using pt Wikipedia frequencies.
        5. Take the 7776 most frequent words.
      
      No further curation was made.
      
      There are obvious drawbacks in this approach (eg: many very frequent
      words are left out because they are either too short or too long or
      contain accents or cedilla), but it was the best cost-benefit i could
      think about.
      7743ed55
  4. Dec 11, 2018
  5. Mar 23, 2018
    • ulif's avatar
      Tell, that we are allowed to use "diceware". · 7523692e
      ulif authored
      We received an official permission from Arnold G Reinhold to use the
      name 'diceware' under the following conditions:
      
      1) we use Pythons `system.random` as default source of randomness
      2) we continue to provide support for real dice
      3) we do not use the name for other tools than this commandline tool
      4) we include the trademark statement commited hereby.
      
      Many thanks go to Mr. Reinhold.
      
      Fixes #48.
      7523692e
  6. Feb 09, 2018
  7. Feb 07, 2018
  8. Jan 27, 2018
  9. Jan 01, 2018
  10. Aug 31, 2017
    • ulif's avatar
      Use docstring only to document function. · 9b7e8a14
      ulif authored
      The documenting comment (starting with ``#: ``) is neccessary for vars
      and constants only. All these comments are used by `Sphinx` to generate
      documentation.
      9b7e8a14
    • ulif's avatar
      Reorder function. · 2694974e
      ulif authored
      Put consts at top of file and functions below. The code is better
      readable this way.
      2694974e
  11. Aug 28, 2017
    • drebs's avatar
      use function to get wordlists path · 07e497e5
      drebs authored
      The path to wordlist files is hardcoded in the generated sphinx
      documentation and because it may differ depending on how the package is
      installed this creates different problems:
      
        - the path hardcoded in the online api documentation may differ from
          the one in each installation, leading to incorrect information
          provided to the user of the api.
      
        - packaging diceware documentation to specific operating systems may
          generate unreproducible packages.
      
      This commit replaces the variable WORDLISTS_DIR by a function called
      get_wordlists_dir() that delays the construction of the path instead of
      doing it during module loading.
      07e497e5
  12. Jun 06, 2017
  13. Jun 04, 2017
  14. May 31, 2017
    • ulif's avatar
      Make `infile` arg a string. · ad5c04f2
      ulif authored
      The `argparse.FileType` type is considered bad style. Cf.
      http://bugs.python.org/issue13824. With this FileType we cannot tell
      when to close a file after reading (which should be done for most
      regular wordlists) or not (in case of input from stdin).
      ad5c04f2
  15. May 29, 2017
  16. May 28, 2017
  17. May 26, 2017
  18. May 23, 2017
  19. Apr 24, 2017
    • ulif's avatar
      Activate logging. · de9ebfeb
      ulif authored
      Logging was available all the time but not yet activated. We now
      enable logging when the main script is executed. This makes the
      `verbose` commandline option functional.
      de9ebfeb
    • ulif's avatar
      Extend docstring. · 5ee8e4d4
      ulif authored
      5ee8e4d4
    • ulif's avatar
      Fix typos. · 1030a786
      ulif authored
      1030a786
  20. Mar 04, 2017
    • ulif's avatar
      Print a hint when discarding rolls. · 6ddfc483
      ulif authored
      When using `realdice` randomness source and picking values out of short
      sequences (sequences shorter than the number of dice sides), we might
      require additional dice throws (if a value rolled is not in range of
      sequence length). We now print a hint when this happens.
      6ddfc483
  21. Mar 03, 2017
  22. Feb 27, 2017
  23. Feb 18, 2017
  24. Jan 06, 2017
  25. Dec 17, 2016
  26. Aug 03, 2016
  27. Aug 02, 2016
  28. Jun 08, 2016
Loading