Demonstrated experience in machine learning, predictive modeling, or statistics / data mining using large amounts of structured, semi structured, and unstructured data
Demonstrated experience using analytically oriented languages such as Python, R or Julia
Demonstrated experience working with large, complex datasets using big data technologies and languages
Experience formulating, approaching, and solving problems on massive datasets
MS or PhD in Mathematics, statistics, Physics, Computer Sciences or Engineering, or related field
Experience with open source packages for
modeling (e.g., Torch, Tensorflow, scikit-learn, xgboost),
visualization (e.g., matplotlib, ggplot, vega, d3.js) or
data processing (e.g., Spark, Stanford CoreNLP, gensim)
Relational database and SQL skills
Experience with cloud infrastructures
Experience with tools and best practices for software engineering, including version control, testing, and review practices
Desired skills:
Ability to work as part of a team and to communicate effectively