FAQ

What is DataDistillr?

DataDistillr (DDR) is a unified platform that enables data-savvy professionals to connect to virtually any data source and query and/or join the data via SQL without requiring ETL (Extract, Transform & Load) support. The data may be relational, non-relational, flat-file, API-accessible, structured, unstructured, semi-structured, etc., and reside virtually anywhere whether on a local machine, in the cloud, or hybrid.

How can I join data given the Join Data button is not functional yet?

We expect that feature to be available in the very near future. In the interim, you can manually write a SQL join statement directly within the query window. A sample query is provided below for your reference (this one joins two movie files):

SELECT dis.title, CONVERT_FROMJSON(tmdb.production_companies)['name'] AS studio
FROM s3.root./kaggle/media/tmdb_movies.csvh AS tmdb
INNER JOIN (SELECT *
FROM s3.root./kaggle/media/DisneyMovies.csvh) AS dis
ON tmdb.original_title = dis.title
ORDER BY dis.title, studio
LIMIT 1000

How can I connect to a database or a cloud storage data source given the configuration button(s) is/are not functional yet?

We expect that feature to be available in the very near future. In the interim, you can send us a request at [email protected] and we can configure it for you on the "back end" given sufficient authorization from the owner of the data.

How can I connect to an API given the configuration button(s) is/are not functional yet?

We expect that feature to be available in the very near future. In the interim, you can manually write a SQL query that hits an API that is publicly available. A sample query is provided below for your reference (this one queries two movie showtimes):

SELECT *
FROM api.movietimes
WHERE movie='Saving Private Ryan' AND city='Las Vega, NV'