Big Data

  1. SQL
    • Hive
    • Apache Phoenix on HBase
    • SparkSQL
    • Presto
    • IBM BigSQL
  2. HBase http://www.uml.org.cn/bigdata/201804131.asp
  3. NiFi https://dzone.com/articles/apache-spark-and-apache-nifi-integration-part-2-of
  4. Knox Gateway

Machine Learning

  1. https://github.com/Shujian2015/data-challenge-2
  2. Machine Learning
  3. Deep Learning Framework
    • Tensorflow/Keras
    • Pytorch (fastai for high level api)
    • mxnet/gluon
    • BigDL for Apache Spark https://github.com/intel-analytics/BigDL

Code Interview

  1. https://github.com/jwasham/coding-interview-university
  2. https://github.com/CyC2018/CS-Notes

Personal Blogs

  1. https://veronikach.com/
  2. https://colah.github.io/
  3. https://www.liaoxuefeng.com/ (Chinese)
  4. http://www.ruanyifeng.com/blog/ (Chinese)

Company Blogs

  1. Nextflix
  2. Dropbox
  3. Alibaba

Others

  1. Internet Trends Report 2018 https://www.kleinerperkins.com/

MOOC

  1. http://deeplearning.cs.cmu.edu/
  2. (Coursera) Functional Programming in Scala

Books

  1. High Performance Spark Note
  2. Design Data-Intensive Applications Note
  3. Streaming Systems

Coding Practice