We have a great deal of experience on the latest Big Data techs. Basically, Ingestion of unstructured, semi-structured and structured data from different type of source systems in various forms and frequencies, storing them in the distributed and fault-tolerant file systems. Curation, Exploitation of data on Big Data Platforms.
We are experts on:
- Databricks
- Apache Spark
- Delta Lake
- Hadoop
- Hive
- Amazon EMR ( Elastic MapReduce )
- Amazon S3
- Azure Data Lake Storage Gen2
- PySpark, Jupyter, Zeppelin