Tiezheng Li
Tiezheng.Li@twosigma.com
Two Sigma Investments, LLC
April 27, 2018
The Beaker Extensions for Jupyter:
Agenda
Beaker Notebook
From Beaker Notebook to BeakerX
BeakerX Live Demo
DISCOVERABLE
DATA
DATA
ANALYSIS
+ MODELING
SCALABLE
+ DISTRIBUTED COMPUTE
PUBLICATION
+ COLLABORATION
OUR VISION FOR DATA SCIENCE
BEAKER,
AN INTRODUCTION
LANGUAGE
MATTERS
BeakerX - Tiezheng Li
BeakerX - Tiezheng Li
Oct 2013
Internal GA
Mar 2015
R, Scala, Java, Python2/3 support
Jun 2015
PySpark, SparkR, Clojure, Kdb
support
Nov 2016
BeakerX Pivot
Apr 2016
External Beaker Lab Alpha
LIFE OF BEAKER
May 2014
Open Source Beta
Aug 2017
BeakerX RC1
OPEN SOURCE WORLD
nbconvert
nbviewer
nbpresent
nbgrader
Jupyter Hub nbdime
nbmanager
binder
Jupyter Lab
FORK? MERGE? JOIN?
THE PIVOT
BeakerX - Tiezheng Li
WE DID IT!
94%
1463
213
● Time Series Visualizations
● JVM Kernels
● Interactive Tables
● Collaborative Publication
● True Polyglot Analysis (in progress)
● Data Discovery (in progress)
BeakerX: A unique addition to the Jupyter Ecosystem
DEMO
Future Work:
● Migration to Jupyter Lab
● Spark deep integration
● True Polyglot Analysis
● Data Discovery
● and more … !
THANK YOU

More Related Content

PPTX
BeakerX Beaker Extensions for Jupyter
PDF
Productionize spark structured streaming
PPTX
The future of Data on Kubernetes
PDF
Testing and Monitoring and Broken Things | Nikki Attea | Sensu
PDF
The Power of GitOps with Flux & GitOps Toolkit
PDF
S3 Server Hackathon Presented by S3 Server, a Scality Product, Seagate and Ho...
PDF
How to Streamline Incident Response with InfluxDB, PagerDuty and Rundeck
PDF
Integrating Google APIs into Your Applications
BeakerX Beaker Extensions for Jupyter
Productionize spark structured streaming
The future of Data on Kubernetes
Testing and Monitoring and Broken Things | Nikki Attea | Sensu
The Power of GitOps with Flux & GitOps Toolkit
S3 Server Hackathon Presented by S3 Server, a Scality Product, Seagate and Ho...
How to Streamline Incident Response with InfluxDB, PagerDuty and Rundeck
Integrating Google APIs into Your Applications

What's hot (19)

PDF
Lessons Learned: Spring Cloud -> Docker -> Kubernetes
PDF
Seventh openCypher Implementers Group Meeting: Status Update
PDF
PDF
This week in Neo4j -13th January 2018
PDF
OSMC 2017 | Ops and dev stories- Integrate everything into your monitoring st...
PDF
Big Data Analytics London - Data Science in the Cloud
PDF
Kubernetes Config Management Landscape
PDF
Streaming Sensor Data with Grafana and InfluxDB | Ryan Mckinley | Grafana
PDF
The Property Graph Query Language Landscape: openCypher and Property Graph Ex...
PDF
Netflix OSS Meetup Season 4 Episode 4
PDF
Spark + i python
PDF
Full Stack Development with Neo4j and GraphQL
PPTX
Building Community APIs using GraphQL, Neo4j, and Kotlin
PDF
This week in Neo4j - 7th October 2017
PDF
Lessons Learned: From Java EE to Spring Cloud in the context of Activiti OSS
PDF
Cloud architectures for data science
PDF
Elastic at Procter & Gamble: A Network Story
PDF
SW360 Update Tooling Telco
PPTX
12th Meeting OpenChain Reference Tooling Work Group - 25th March - Slides
Lessons Learned: Spring Cloud -> Docker -> Kubernetes
Seventh openCypher Implementers Group Meeting: Status Update
This week in Neo4j -13th January 2018
OSMC 2017 | Ops and dev stories- Integrate everything into your monitoring st...
Big Data Analytics London - Data Science in the Cloud
Kubernetes Config Management Landscape
Streaming Sensor Data with Grafana and InfluxDB | Ryan Mckinley | Grafana
The Property Graph Query Language Landscape: openCypher and Property Graph Ex...
Netflix OSS Meetup Season 4 Episode 4
Spark + i python
Full Stack Development with Neo4j and GraphQL
Building Community APIs using GraphQL, Neo4j, and Kotlin
This week in Neo4j - 7th October 2017
Lessons Learned: From Java EE to Spring Cloud in the context of Activiti OSS
Cloud architectures for data science
Elastic at Procter & Gamble: A Network Story
SW360 Update Tooling Telco
12th Meeting OpenChain Reference Tooling Work Group - 25th March - Slides
Ad

More from Two Sigma (20)

PPTX
The State of Open Data on School Bullying
PPTX
Halite @ Google Cloud Next 2018
PPTX
Future of Pandas - Jeff Reback
PPTX
Engineering with Open Source - Hyonjee Joo
PDF
Bringing Linux back to the Server BIOS with LinuxBoot - Trammel Hudson
PPTX
Waiter: An Open-Source Distributed Auto-Scaler
PPTX
Responsive and Scalable Real-time Data Analytics for SHPE 2017 - Cecilia Ye
PPTX
Archival Storage at Two Sigma - Josh Leners
PPTX
Smooth Storage - A distributed storage system for managing structured time se...
PDF
The Language of Compression - Leif Walsh
PDF
Identifying Emergent Behaviors in Complex Systems - Jane Adams
PDF
Algorithmic Data Science = Theory + Practice
PDF
HUOHUA: A Distributed Time Series Analysis Framework For Spark
PDF
Improving Python and Spark Performance and Interoperability with Apache Arrow
PDF
TRIEST: Counting Local and Global Triangles in Fully-Dynamic Streams with Fix...
PPTX
Exploring the Urban – Rural Incarceration Divide: Drivers of Local Jail Incar...
PDF
Graph Summarization with Quality Guarantees
PDF
Rademacher Averages: Theory and Practice
PDF
Credit-Implied Volatility
PPTX
Principles of REST API Design
The State of Open Data on School Bullying
Halite @ Google Cloud Next 2018
Future of Pandas - Jeff Reback
Engineering with Open Source - Hyonjee Joo
Bringing Linux back to the Server BIOS with LinuxBoot - Trammel Hudson
Waiter: An Open-Source Distributed Auto-Scaler
Responsive and Scalable Real-time Data Analytics for SHPE 2017 - Cecilia Ye
Archival Storage at Two Sigma - Josh Leners
Smooth Storage - A distributed storage system for managing structured time se...
The Language of Compression - Leif Walsh
Identifying Emergent Behaviors in Complex Systems - Jane Adams
Algorithmic Data Science = Theory + Practice
HUOHUA: A Distributed Time Series Analysis Framework For Spark
Improving Python and Spark Performance and Interoperability with Apache Arrow
TRIEST: Counting Local and Global Triangles in Fully-Dynamic Streams with Fix...
Exploring the Urban – Rural Incarceration Divide: Drivers of Local Jail Incar...
Graph Summarization with Quality Guarantees
Rademacher Averages: Theory and Practice
Credit-Implied Volatility
Principles of REST API Design
Ad

Recently uploaded (20)

PPT
Classification methods in data analytics.ppt
PPTX
PPT for Diseases.pptx, there are 3 types of diseases
PPTX
865628565-Pertemuan-2-chapter-03-NUMERICAL-MEASURES.pptx
PPTX
Hushh Hackathon for IIT Bombay: Create your very own Agents
PDF
ahaaaa shbzjs yaiw jsvssv bdjsjss shsusus s
PPTX
recommendation Project PPT with details attached
PPT
statistic analysis for study - data collection
PPT
expt-design-lecture-12 hghhgfggjhjd (1).ppt
PPTX
inbound6529290805104538764.pptxmmmmmmmmm
PDF
Concepts of Database Management, 10th Edition by Lisa Friedrichsen Test Bank.pdf
PPTX
AI AND ML PROPOSAL PRESENTATION MUST.pptx
PPTX
indiraparyavaranbhavan-240418134200-31d840b3.pptx
PPTX
GPS sensor used agriculture land for automation
PPT
PROJECT CYCLE MANAGEMENT FRAMEWORK (PCM).ppt
PDF
©️ 02_SKU Automatic SW Robotics for Microsoft PC.pdf
PPTX
Introduction to Fundamentals of Data Security
PPTX
Business_Capability_Map_Collection__pptx
PDF
Session 11 - Data Visualization Storytelling (2).pdf
PPT
dsa Lec-1 Introduction FOR THE STUDENTS OF bscs
PDF
Grey Minimalist Professional Project Presentation (1).pdf
Classification methods in data analytics.ppt
PPT for Diseases.pptx, there are 3 types of diseases
865628565-Pertemuan-2-chapter-03-NUMERICAL-MEASURES.pptx
Hushh Hackathon for IIT Bombay: Create your very own Agents
ahaaaa shbzjs yaiw jsvssv bdjsjss shsusus s
recommendation Project PPT with details attached
statistic analysis for study - data collection
expt-design-lecture-12 hghhgfggjhjd (1).ppt
inbound6529290805104538764.pptxmmmmmmmmm
Concepts of Database Management, 10th Edition by Lisa Friedrichsen Test Bank.pdf
AI AND ML PROPOSAL PRESENTATION MUST.pptx
indiraparyavaranbhavan-240418134200-31d840b3.pptx
GPS sensor used agriculture land for automation
PROJECT CYCLE MANAGEMENT FRAMEWORK (PCM).ppt
©️ 02_SKU Automatic SW Robotics for Microsoft PC.pdf
Introduction to Fundamentals of Data Security
Business_Capability_Map_Collection__pptx
Session 11 - Data Visualization Storytelling (2).pdf
dsa Lec-1 Introduction FOR THE STUDENTS OF bscs
Grey Minimalist Professional Project Presentation (1).pdf

BeakerX - Tiezheng Li

Editor's Notes

  • #2: Good morning and welcome to this session. The Beaker Extensions for Jupyter: BeakerX Before that let me first introduce myself and what I do. My name is Tiezheng Li I am a software engineer at Two Sigma Since joining Two Sigma I’ve been working on a team that builds products for Modelers that make data easy to discover, consume, publish and visualize in Two Sigma BeakerX is one of our approaches to accomplish this goal.