Intro to SQL++ - Detroit Tech Watch - June 2019

SQL++ FOR BIG DATA
Same Language, More Power
Matthew D. Groves

2
SQL, for the win
https://insights.stackoverflow.com/survey/2019

01/
02/
03/
04/
SQL & Relational
NoSQL
Analytics & Reporting
Summary & More Resources
AGENDA

5
• Relational
• E.F. Codd invented the relational model
• Alpha
• SQL
• Created by Don Chamberlin & Raymond Boyce
• Designed to be English-friendly
• "SQL" and "relational" are now synonyms
Relational and SQL

6
• Impedance mismatch
• Scaling
• Inflexibility
Criticisms of SQL/relational

7
Impedance mismatch
ID Username DateCreated
1 mgroves 2019-06-13
2 agroves 2019-06-14
. . .
. . .
CartID Item Price Qty
1 hat 12.99 1
1 socks 11.99 1
2 t-shirt 15.99 1
. . . .
. . . .
public class ShoppingCart
{
public int Id;
public string Username;
public List<Items> Items;
}
ShoppingCart
ShoppingCartItems

9
Inflexibility
Billing
ConnectionsPurchases
Contacts
Customer

10
• A relational database may be…
Disclaimer!

13
Example 1
{
"callsign": "UNITED",
"country": "United States",
"name": "United Airlines",
"type": "airline"
}
document key: airline_5209

14
Example 2
document key: route_55758
{
"airline": "UA",
"airlineid": "airline_5209",
"destinationairport": "ORD",
"distance": 1050.394306634423,
"equipment": "ER4 ERJ",
"schedule": [
{ "day": 0, "flight": "UA479", "utc": "15:05:00" },
{ "day": 1, "flight": "UA842", "utc": "02:27:00" },
{ "day": 1, "flight": "UA252", "utc": "03:00:00" },
// ... etc ...
],
"sourceairport": "CMH",
"stops": 0,
"type": "route"
}

15
• Get by key
• Set by key
• Delete by key
• Other "operational" query
NoSQL basic operations

16
Problems:
1. Large amounts of data
2. Queries against the data could impact
operations
What about reporting and analytics?

18
Operational vs Analytics vs Operational Analytics

19
Fewer queries
Operational Analytics workload
Adhoc
Could be complex Performance is nice-to-have

20
How are operational analytics done?

22
Answer 2: Export to relational
Data
ETL
SQL

23
Answer 3: Hadoop?
https://medium.com/@ylashin/big-data-using-hdinsight-a-journey-in-the-zoo-ecosystem-c78b913a5ed9

25
SQL Example
ID foo bar baz
1 matt groves qux
2 ali groves notqux
3 emma groves notqux
mytable
SELECT foo, bar
FROM mytable
WHERE baz = 'qux'

26
SQL++ Example
key: 1
{
"foo" : "matt",
"bar" : "groves",
"baz" : "qux"
}
key: 2
{
"foo" : "ali",
"bar" : "groves",
"baz" : "notqux"
}
key: 3
{
"foo" : "emma",
"bar" : "groves",
"baz" : "notqux"
}
mybucket
SELECT foo, bar
FROM mybucket
WHERE baz = 'qux'

28
• JOIN
• UNION
• aggregation / GROUP BY
• SELECT
• LET
• LIMIT
• ORDER BY
• etc…
SQL++ is backwards compatible

30
Superpower: Nested Objects
key 1
{
"name" : "matt",
"address" : {
"street" : "White Rd",
"city" : "Grove City",
"state" : "OH"
}
}
key 2
{
"name" : "emma",
"address" : {
"street" : "High St",
"city" : "Columbus",
"state" : "OH"
}
}
SELECT address.city
FROM myusers
myusers

31
Superpower: arrays
key 1
{
"name" : "matt",
"favoriteFoods" : [
"pizza",
"cheesecake",
"donuts"
]
}
key 2
{
"name" : "emma",
"favoriteFoods" : [
"donuts",
"Lucky Charms",
"chicken"
]
}
SELECT favoriteFoods[1]
FROM myusers
myusers

32
Superpower: UNNEST
key 1
{
"name" : "matt",
"favoriteFoods" : [
"pizza",
"cheesecake",
"donuts"
]
}
SELECT food, u.name
FROM myusers u
UNNEST u.favoriteFoods food;
myusers
[
{
"food": "pizza",
"name": "matt"
},
{
"food": "cheesecake",
"name": "matt"
},
{
"food": "donuts",
"name": "matt"
}
]

33
Superpower: Quantification
key 1
{
"name" : "matt",
"favoriteFoods" : [
"pizza",
"cheesecake",
"donuts"
]
}
key 2
{
"name" : "emma",
"favoriteFoods" : [
"donuts",
"Lucky Charms",
"chicken"
]
}
SELECT u.name
FROM eftest u
WHERE ANY f
IN u.favoriteFoods
SATISFIES f == 'pizza'
END;
myusers

34
• Couchbase
• AsterixDb
• Apache Drill
• Others coming soon?
SQL++ Implementations

35
Implementation 1: Couchbase
SQL++

36
Implementation 2: AsterixDB

37
Implementation 3: Apache Drill

39
No
NoSQL doesn't mean NoSQL anymore
++SQL

40
SQL++ is SQL with JSON Superpowers

41
Minimize your ETL, maximize your SQL skills
ETL
👎
SQL
👍

42
• E.F. Codd original research paper
• http://db.dobo.sk/wp-content/uploads/2015/11/Codd_1970_A_relational_model.pdf
• The Free Lunch is Over
• http://www.gotw.ca/publications/concurrency-ddj.htm
• Original SEQUEL paper
• https://dl.acm.org/citation.cfm?id=811515
Resources: SQL/scaling

43
• UCSD
• http://forward.ucsd.edu/sqlpp.html
• The SQL++ Query Language
• https://arxiv.org/abs/1405.3631
Resources: UCSD Research

44
• Book: SQL++ for SQL Users
• Amazon: https://www.amazon.com/SQL-Users-Tutorial-Don-Chamberlin/dp/0692184503/
• Free PDF: https://resources.couchbase.com/sql_tutorial
• Videos
• NoSQL and SQL++, two sides of the same coin:
https://www.youtube.com/watch?v=KGKiSyJa0-k
• Tech Panel on Query Language Evolution:
https://www.youtube.com/watch?v=LAlDe1w7wxc
Resources: Don Chamberlin

45
•@mgroves
•twitch.tv/matthewdgroves
•forums.couchbase.com
•Find me after this session!
Resources: Me!

Intro to SQL++ - Detroit Tech Watch - June 2019

More Related Content

Similar to Intro to SQL++ - Detroit Tech Watch - June 2019

More from Matthew Groves

Recently uploaded

Intro to SQL++ - Detroit Tech Watch - June 2019

Editor's Notes