Sample Datasets
StarTree Cloud comes with some sample datasets that you can use to evaluate Apache Pinot if you don't have your own data to use.
Select Dataset
- Simple Website
- Website with Anomalies
Click Use Sample Data and then Simple Website under the Which data do you want to use? heading:
Simple Website Dataset
Click Use Sample Data and then Website with Anomalies under the Which data do you want to use? heading:
Anomalies Dataset
Now click NEXT.
Review
- Simple Website
- Website with Anomalies
You'll now see the review and submit screen, where you can review the dataset before importing it.
Review Data Source
If you want to choose a different sample dataset, click on the PREV button to go back to the previous screen.
Once you're ready to create the data source, click on the FINISH button. You'll then see the following screen:
Data Source Created
You'll now see the review and submit screen, where you can review the dataset before importing it.
Review Data Source
If you want to choose a different sample dataset, click on the PREV button to go back to the previous screen.
Once you're ready to create the data source, click on the FINISH button. You'll then see the following screen:
Data Source Created
Query Data Source
- Simple Website
- Website with Anomalies
To have a look at the data that we've imported, click on the Query Console link, which will open the Pinot Data Explorer. Click on the table name that you used (we use the name anomalies in this guide) and then click RUN QUERY to run a basic query against the data source:
Query simple Data Source
Below are some other queries that you can run against this dataset:
select sum(views)
from simple
Total views
select count(*)
from simple
WHERE hoursSinceEpoch > 770500
Recent events
To have a look at the data that we've imported, click on the Query Console link, which will open the Pinot Data Explorer. Click on the table name that you used (we use the name anomalies in this guide) and then click RUN QUERY to run a basic query against the data source:
Query anomalies Data Source
Below are some other queries that you can run against this dataset:
select browser, sum(views) AS totalViews
from anomalies
group by browser
order by totalViews DESC
Top browsers
select country, count(*) AS count
from anomalies
where clicks > 0
group by country
order by count DESC
Most popular country