Head of Data Science at Colearn

Apply for this opportunity
Jakarta, Indonesia
Full-time

About this opportunity

About Colearn

We are Indonesia’s fastest growing education technology company on a mission to bring Indonesian students to the top 50% of global PISA rankings by 2025. We see Data Science playing a huge role in making our mission a reality.

Position Summary

  • Responsible for developing machine learning solutions in Natural Language Processing (NLP),document classification, Named Entity Recognition (NER), topic modelling, document summarization, computational linguistics, advanced and semantic information search, extraction, induction, classification and exploration.
  • Create ML models for Advanced OCR and Cognitive Data Extraction capability as well as its execution.
  • Develop, maintain and deploy ML & NLP Pipeline and models
  • Create NLP/ML models with high performance, quality, and stability.

Requirements


  • Prior experience of building Data Science teams ground up and building for scale
  • At least 2 years experience in designing and developing enterprise-scale NLP solutions in two or more of: Named Entity Recognition, Document Classification, Document Summarization, Topic Modelling,Dialog Systems, Sentiment Analysis, OCR text processing
  • Excellent knowledge and demonstrable experience in using open source NLP packages such as and not limited to NLTK, Word2Vec, SpaCy, Gensim, Standford CoreNLP.
  • Strong knowledge and working experience with a strong understanding of NLP/ML & algorithms and models (GLMs, SVM, PCA, NB, Clustering, DTs) and their underlying computational and probabilistic statistics.
  • At least 3 years programming experience in one or more of the following: Python, R, Scala.
  • Experience in setting up supervised & unsupervised learning ML/NLP models including data cleaning, data analytics, feature creation, model selection & ensemble methods, performance metrics &visualization
  • 1 to 2 years experience in ML/NLP development pipelines of large data sets, both structured &unstructured
  • 1 to 2 years experience building Machine Learning & NLP solutions over open source platforms such as SciKit-Learn,Tensorflow, SparkML, Torch, Caffe, H2O

Non-Technical Skills


  • Setting up and guiding a Data Science org.
  • Highly motivated, proactive and a self-starter; strong sense of ownership & ability to create and execute
  • Critical thinker; ability to analyze problems and identify issues and provide solutions
  • Analytical abilities & great problem solving
  • Highly organized. Effectively prioritizes and balances multiple efforts in a fast-paced environment
  • Good communication and presentation skills
Apply for this opportunity