Welcome to the Data Super Market

We democratize and automate Big Data and predictive analytics for small and medium-sized businesses.


You have the data in an Excel or in a database.
We sign an NDA (non disclosure agreement), collect the data and do the following analysis:
– What products do customers like to buy?
– Which products are most commonly bought together?
– What are the 5 most purchased products lately (3 months) and which customers do not have them yet?
We sent 3 reports with the results
More information about our products

We offer consulting, training and demonstrations in Big Data and predictive analysis.

We work from Barcelona, Madrid, Portugal, London, France, Germany, Switzerland, Italy, United States, Canada, South America and India. More information about us

The Data Supermarket – Big Data

A platform where customers can upload their data autonomously (excel, csv, data bases etc.) and can create and consume reports in an easy and automated way, with the following technologies:

Sources: Cloud, DWH, Data Lake, Data base, Social (Twitter, facebook, etc.) con Natual Language Processing (NLP, example twitter and Spanish political groups) sentiment analysis, subjectivity, recognition of the speaker and real-time analysis

Data ingestion: Sqoop, Falcon, Kafka, Flume, Apex, Storm

Data Bases: MySQL, Oracle, MSSQL, Sybase, Informix, Teradata, Redshift, PDW (Parallel Data Warehouse Cluster)

Unstructured databases: Cassandra, MongoDB, HBase

Data access: Hive, Spark, Presto, Phoenix, Drill, Impala, Hue, Kudu

Compute engines: Map Reduce, TEZ, Spark

Data management: Zarn, HDFS (hadoop distributed file system), MapR

Security: Kerberos, Knox

System management: Oozie, Ambari, Zookeeper

ETL (extraction, transformation and load): Talend, Pentaho (PDI), Power Center (Informatica), SparkSQL, MS Integration Services

Reporting: Microstrategy, QlikView/Sense, Tableau, SAP Business Objects, Cognos, Hyperion, JReports, MS Reporting Services, PowerBI

Statistics and machine learning (ML): Mahout, SparkML, R, Scipy, Numpy, SAS, MATLAB, F#, neural networks Tensorflow

Programming languages: VB, C#, F#, java, python, C++, including embedded versions

Data Lake: Lambda, Kappa, adHoc (AWS, Google Cloud, Azure), as a platform, as a service.

Ecosystems: Cloudera, Hortonworks, Amazon Web Services (AWS), Google Cloud, Microsoft Azure


big data and predictive analysis
big data and predictive analysis