Skip to content
Snippets Groups Projects
data.mdwn 1.19 KiB
Newer Older
  • Learn to ignore specific revisions
  • rhatto's avatar
    rhatto committed
    [[!meta title="Data science, lean databases and formats"]]
    
    ## Basic
    
    * Ontologies and how to deal with lists.
    * Standards: schema.org, microdata, microformats, json, yaml, csv, dot, vcard.
    * Intelligence: how to easilly search, index and produce outputs with strutured data?
    * Samples: TODO and ChangeLog (see [yankee: Changelogs meet YAML](https://github.com/studio-b12/yankee)).
    
    ## Software
    
    * [mtail](https://packages.debian.org/stable/mtail).
    * [Scrapy | A Fast and Powerful Scraping and Web Crawling Framework](https://scrapy.org/).
    * [phantomjs in stretch](https://packages.debian.org/stable/phantomjs).
    * [wpull](https://wpull.readthedocs.io/en/master/usage.html).
    * [Darktable - virtual lighttable and darkroom for photographers](https://packages.debian.org/stable/darktable).
    * OsmAnd and GPX tracks.
    
    ## API, bigdata, etc
    
    * https://stripe.com/blog/idempotency
    * https://botman.io
    * https://github.com/metabase/metabase
    * [Apache Drill](https://drill.apache.org/), [presto](https://github.com/prestodb/presto), hadoop, etc.
    
    rhatto's avatar
    rhatto committed
    * [Redash](https://redash.io/).
    
    rhatto's avatar
    rhatto committed
    * [TensorFlow](https://www.tensorflow.org/).
    * [Wikidata](https://www.wikidata.org).
    * [Swagger Specification](http://swagger.io/specification/).