Data Scientist

Job Summary

We are looking for a Data Scientist that will help us discover the information hidden in vast amounts of data, and help us make smarter decisions to deliver even better products. Your primary focus will be in applying data mining techniques, doing statistical analysis, and building high quality prediction systems integrated with our products. You will need to rapidly prototype various algorithmic implementations and test their efficacy using appropriate experimental design and hypothesis validation.

Experience is preferably in different programming languages and familiarity with disciplines such as: machine learning, artificial intelligence, conceptual modelling, statistical analysis, predictive modelling, and hypothesis testing.

Essential Duties

  • Selecting features, building, and optimizing classifiers using machine learning techniques

  • Data mining using state-of-the-art methods

  • Extending company’s data with third party sources of information when needed

  • Enhancing data collection procedures to include information that is relevant for building analytic systems

  • Processing, cleansing, and verifying the integrity of data used for analysis

  • Doing ad-hoc analysis and presenting results in a clear manner

  • Creating automated anomaly detection systems and constant tracking of its performance

Preferred Education and Experience

  • Experience: Minimum of 10 years delivering world-class data science outcomes, you solve complex analytical problems using quantitative approaches with your unique blend of analytical, mathematical and technical skills.

  • Education: MS or PhD in Applied Statistics, Artificial Intelligence, Computer Science, Data Mining, Machine Learning, Physics, Statistics, or related quantitative discipline.

  • A deep understanding of statistical and predictive modeling concepts, machine-learning approaches, clustering and classification techniques, and recommendation and optimization algorithms.

  • Passionate about asking and answering questions in large datasets, and you are able to communicate that passion to product managers and engineers.

  • Excellent understanding of machine learning techniques and algorithms, such as k-NN, Naive Bayes, SVM, Decision Forests, etc.

  • Experience with common data science toolkits, such as R, Weka, NumPy, MatLab, etc.

  • Excellence in at least one of these is highly desirable

  • Experience with data visualisation tools, such as D3.js, GGplot, etc.

  • Proficiency in using query languages such as SQL

  • Experience with NoSQL databases, such as Cassandra and ElasticSearch

  • Experience with streaming data platforms such as Kafka

  • Experience with distributed data processing frameworks such as Hadoop and Spark

  • Experience with AWS cloud architecture, including Lambda, EC2, and AWS Kinesis is highly desirable.

  • Applied statistics skills, such as distributions, statistical testing, regression, etc.

  • Ability to communicate findings, orally, and visually.

