Priori Data | Big Data Engineer | Berlin, Germany | ONSITE | www.prioridata.com
ABOUT PRIORI DATA
Priori Data is a Berlin-based app market intelligence company.
We help our partners and clients make data driven decisions around the app economy by providing download, revenue and usage estimates for every relevant app and game, as well as tools for keyword optimization (ASO).
Various stakeholders in the app economy rely on our products, including top developers, large brands, leading venture capital firms and consultancies.
OUR TECHNOLOGY
* BigQuery. We use BQ to store and analyse massive datasets without the need to manage any infrastructure. It’s our data lake.
* Kubernetes. We are currently using Google Kubernetes Engine to aid in resource intensive tasks, like generation Machine Learning predictions for our data models.
* Apache Airflow is our data processing pipeline orchestration tool
* Self managed Celery & Sidekiq – for job/task queue management
* Monitoring and observability with Grafana and InfluxDB* (Several other products from Google Cloud stack, Cloud SQL, Pub/Sub, StackDriver, Cloud Storage etc.
* Python and Ruby as main programming languages.
WHAT WE’RE LOOKING FOR
* You will help setup and maintain large scale, reliable distributed data collection systems
* You will closely collaborate with Data Scientists, Developers and Product teams
* You have some knowledge about designing and implementing Distributed Systems
* You know your way around Python or Ruby as your programming language
* You have knowledge of Linux platforms and scripting capabilities (i.e., Bash, Ansible)
* You have experience with cloud platforms like Google Cloud Platform/GCP or AWS
* Ideally have experience with Docker and container orchestration tools
* You are able to communicate your findings clearly to both tech and non-tech audiences. (We are an English-speaking office so German is not required.)
ABOUT PRIORI DATA
Priori Data is a Berlin-based app market intelligence company. We help our partners and clients make data driven decisions around the app economy by providing download, revenue and usage estimates for every relevant app and game, as well as tools for keyword optimization (ASO). Various stakeholders in the app economy rely on our products, including top developers, large brands, leading venture capital firms and consultancies.
OUR TECHNOLOGY
* BigQuery. We use BQ to store and analyse massive datasets without the need to manage any infrastructure. It’s our data lake. * Kubernetes. We are currently using Google Kubernetes Engine to aid in resource intensive tasks, like generation Machine Learning predictions for our data models. * Apache Airflow is our data processing pipeline orchestration tool * Self managed Celery & Sidekiq – for job/task queue management * Monitoring and observability with Grafana and InfluxDB* (Several other products from Google Cloud stack, Cloud SQL, Pub/Sub, StackDriver, Cloud Storage etc. * Python and Ruby as main programming languages.
WHAT WE’RE LOOKING FOR
* You will help setup and maintain large scale, reliable distributed data collection systems * You will closely collaborate with Data Scientists, Developers and Product teams * You have some knowledge about designing and implementing Distributed Systems * You know your way around Python or Ruby as your programming language * You have knowledge of Linux platforms and scripting capabilities (i.e., Bash, Ansible) * You have experience with cloud platforms like Google Cloud Platform/GCP or AWS * Ideally have experience with Docker and container orchestration tools * You are able to communicate your findings clearly to both tech and non-tech audiences. (We are an English-speaking office so German is not required.)
Email: jobs@prioridata.com