Week 9 - Asynchronous Tasks

Uploaded by

spheresponsor

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

28 views37 pages

Week 9 - Asynchronous Tasks

Uploaded by

spheresponsor

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 37

Asynchronous Tasks

Web servers ●
●
How does a web server work
Threads and background tasks
● Long running tasks
Web server
Simplest possible HTTP server

● Open port 80 in “listen” mode - wait for incoming connections

● If incoming connection, read text, look for request
● Send back a response
Blocking connections with Flask
● Flask in “non-threaded” mode
○ https://replit.com/@nchandra/FlaskBlocking#main.py
● vs. Threaded mode
○ Default operation of Flask
Threaded web server
● Threaded server
○ Accept incoming request
○ Immediately start a thread to handle request
○ Go back and listen for next request
● Limitations
○ Each thread consumes resources
○ Depends on OS for handling parallel / concurrent execution
● Note: Threads are concurrent - parallelism depends on hardware
Blocking server
● Client blocks until server responds
● Can be bad for interactivity
● Need not block other clients
○ Depends on threading
Long running tasks
Example: face recognition on uploaded photos

● User uploads photos

● Server runs face detection on each photo
● Then face recognition against known database
● Alert when match found
Face recognition task problems
Blocking Threading

● User uploads photo, but gets no response ● Only one user can upload a photo at a
till task complete time?
● Cannot navigate away, do not know ● Large photos block server for long time
response ● Uncontrolled thread creation drags down
server performance
General problem
● Should web server directly run compute intensive tasks?
○ Or stick to handling application logic, rendering, file serving?
● Can tasks be handed off to outside servers
○ Specialized for types of compute
○ Different scaling algorithms than web
● How should web server and compute servers communicate?
○ Automatically handle scaling
○ Allow easy task distribution
Asynchronous Task Frameworks
Goals:

● User can define set of tasks

● Web server can “dispatch” tasks to be executed later
● Asynchronous completion and updating possible
When to use?
● Response to user does not depend on execution of task
○ Example: send email - can display a “sending” message and later update status
● Example of when NOT to use:
○ API fetch: response must be based on result of API query
○ async task will not help since you will need to block and wait for response
● Note: this is NOT the same as async on frontend!
○ Async frontend with UI reactive update is still useful
○ But the backend process should return with the correct response
Requirements
● Messaging / Communication system
○ Message queues
○ Brokers / Backends
● Execution system
○ Threads / coroutines / greenlets …
○ concurrent models
○ Can be in another language, runtime, …

Example: Celery for Python

Message Queues
● Messaging
● Channels, Queues,
Exchanges…
● Publish/Subscribe
Client-Server
Server-Server
Communication between servers
● Many-to-many
○ Point to point: too many connections
● Scaling
○ Add new servers
● Asymmetric
○ Not all servers need to talk to all others
○ Some produce messages, others consume
● Failure tolerance
○ Offline servers - failover
○ Busy servers - retry
Messaging
● Communicate through message queues or brokers
● Decouple message from execution
○ Server for execution sends a message - another server picks it up
● Asynchronous communication
○ No need to wait for response - or response may be delayed
● Dataflow processing
○ React to presence of messages
○ Automatically adjust to rate of processing determined by activity
● Ordered transactions
○ First-in-first-out
Message Broker
Potential benefits
● Scalability
○ Can easily add servers to consume messages as needed
● Traffic spikes
○ Messages retained in queue until processed - may be delayed, but not lost
● Monitoring
○ Easy point of reference for monitoring performance: number of messages unprocessed
● Batch processing
○ Collect messages into a queue and process them at one shot
Variants
● Message Queues
○ Mostly point to point: producer -> queue -> consumer
● Pub/Sub: Fanout
○ Producers publish without knowing who will read, multiple subscribers consume
● Message Bus
○ Analogy to hardware bus: multiple entities communicate over shared medium, addressable
● APIs / Web services
○ Direct point to point - communicate between services directly: less resilient, no storage
● Databases
○ A message is a piece of information: store in database - not normally well suited
Advanced Message Queueing Protocol - AMQP
● Standard similar to HTTP, SMTP
○ Details of how to connect, initiate transfers, establish logical connections etc
● Many open-source implementations
○ RabbitMQ, Apache ActiveMQ etc
● Broker
○ Manage transfer of messages between entities
○ “Message exchange” intermediary - clients always talk to exchange
● RabbitMQ
○ Well suited for complex message routing
Redis
● In-memory database
○ Key-value store
○ Not originally designed for messaging at all
● Pub/Sub pattern
● Very high performance due to in-memory
○ But lacks persistence - data lost on shutdown
● Excellent for small messages
○ Performance degrades for large messages
Summary
● Distributed systems need messaging
● Complex messaging patterns possible
○ Point-to-point
○ Publish / Subscribe
● Many messaging systems exist
○ One more service to install and maintain
○ Useful at scale or for long running tasks
● Most useful in context of task queues
Asynchronous
Tasks with Celery
Task Queues
How can we manage large numbers of long running tasks without interfering with
ability to respond to user requests

● User request handler “pushes” task onto a queue

○ First-in-first-out: give priority in order that tasks issued, or can have separate priorities
● Separate queue managers to handle execution of tasks and returning results

Asynchronous: in general no guarantees of timely response˛

Asynchronous Task Execution
● Language supported:
○ Python asyncio
○ JS async/await
Asynchronous Task Execution
● Language supported:
○ Python asyncio
○ JS async/await
● Guarantees of completion
● Reliable against server failure
● Ability to auto-retry
General Principles
● Pushing a task onto a queue should be faster than executing the task
○ Else not worth using a queue - just finish the task
● There should be enough worker resources to empty the queue eventually
○ Else build up backlog and eventually overflow
Potential problems
● Problems like any other distributed system
● Deadlock and related issues
○ Message system does not accept messages: block or lose data?
● Buffer sizing, overflows
Scenario: Push Queue
● Client pushes task onto server queue
● Server should start operation “immediately”
○ May be delayed based on availability of resources etc
● Closer to “real-time” operation
● Example:
○ Update friend list in social media application: push updates to DB for all friends
○ Send emails: push emails onto queue to be sent out individually
Scenario: Pull Queue
● Clients can push tasks at any time
● Server “polls” queue at regular intervals
● Better suited to “batch-mode” operation
○ Generally not real-time
● Example:
○ Batch update of high scores in gaming server: periodic updates
○ Dashboard updates - process many log entries in batch and update periodically
Pull mechanisms
● Polling
○ Periodically check on state of queue
○ CPU / network intensive - repetitive function calls
● Long poll
○ Server keeps connection open until data present
○ Client blocks until data received
Examples - High End
● Google AppEngine
○ TaskQueue - APIs
● Tencent cloud
● AWS
○ SQS - simple queue service - messaging
○ worker tasks implemented separately
Examples - General Use
● Celery - Python library
● RQ - Redis Queue
● Huey, Django-carrot, …

We are mostly interested in Python APIs, but exist for most languages

● Messaging systems are language independent

● Task queue builds on top of message system + language
Celery
● Python package for handling asynchronous tasks
● Requires a separate broker for messaging
● Also a backend for collecting and storing results
● Multiple celery instances can “auto-discover” through the messaging system
● Abstracts away the messaging system to focus on tasks
Using Celery
● Problem: multiple moving parts
○ Message broker
○ Result collector
○ Celery instance to run workers
○ Actual code
● Installing and managing needs care
● Use when needed
● Can use on platforms like replit
○ Requires a little extra work

15 441 P3 Writeup Fall2022
No ratings yet
15 441 P3 Writeup Fall2022
8 pages
Learn Multithreading with Modern C++
From Everand
Learn Multithreading with Modern C++
James Raynard
No ratings yet
System Design Interview
No ratings yet
System Design Interview
4 pages
Pick N Mix - DJUGL Sep 09
No ratings yet
Pick N Mix - DJUGL Sep 09
56 pages
Distributed Systems Chapter 3-Processes
No ratings yet
Distributed Systems Chapter 3-Processes
33 pages
Ds Chapter 3-Processes (1)
No ratings yet
Ds Chapter 3-Processes (1)
31 pages
Distributed Systems Chapter 3-Processes
No ratings yet
Distributed Systems Chapter 3-Processes
24 pages
Con Currency and Distributed System in Python
100% (2)
Con Currency and Distributed System in Python
51 pages
Chapter 3 Processes2
No ratings yet
Chapter 3 Processes2
33 pages
Notes
No ratings yet
Notes
14 pages
ch6
No ratings yet
ch6
7 pages
PYTHON04 Nikos Kapetanos Build An IoT SaaS Using Python
No ratings yet
PYTHON04 Nikos Kapetanos Build An IoT SaaS Using Python
33 pages
Chapter 3-Processes
No ratings yet
Chapter 3-Processes
40 pages
Chapter 3-Processes
No ratings yet
Chapter 3-Processes
40 pages
PyParallel: How We Removed The GIL and Exploited All Cores
No ratings yet
PyParallel: How We Removed The GIL and Exploited All Cores
153 pages
Course Introduction: OSI Reference Model
No ratings yet
Course Introduction: OSI Reference Model
26 pages
Sisteme Distribuite Documentatie Laboratorul 2
No ratings yet
Sisteme Distribuite Documentatie Laboratorul 2
13 pages
Software Construction Assignment 3
No ratings yet
Software Construction Assignment 3
32 pages
Chapter 3-Processes
No ratings yet
Chapter 3-Processes
40 pages
Chap-3 - Process
No ratings yet
Chap-3 - Process
32 pages
Docs Celeryproject Org en 4.4.1
No ratings yet
Docs Celeryproject Org en 4.4.1
797 pages
Unit 03 Processes
No ratings yet
Unit 03 Processes
86 pages
Lect 3
No ratings yet
Lect 3
34 pages
Lamp and Rest
No ratings yet
Lamp and Rest
25 pages
DISTRIBUTED SYSTEM CH3-5
No ratings yet
DISTRIBUTED SYSTEM CH3-5
61 pages
Chapter 3-Processes2 (1) (1)
No ratings yet
Chapter 3-Processes2 (1) (1)
33 pages
Redes y Servicios11
No ratings yet
Redes y Servicios11
211 pages
Threads Revisited: Lecture Notes
No ratings yet
Threads Revisited: Lecture Notes
8 pages
howto-sockets
No ratings yet
howto-sockets
6 pages
Software Construction Assignment 3
No ratings yet
Software Construction Assignment 3
33 pages
chapter 3 edited
No ratings yet
chapter 3 edited
25 pages
Advance Python Program Unit IV
No ratings yet
Advance Python Program Unit IV
9 pages
Celery - Best Practices
No ratings yet
Celery - Best Practices
5 pages
Celery_ The Essential Python Library for Distribut
No ratings yet
Celery_ The Essential Python Library for Distribut
4 pages
Computer Networks
No ratings yet
Computer Networks
27 pages
Chapter 3 Processe
No ratings yet
Chapter 3 Processe
43 pages
Chapter 3-Processes
No ratings yet
Chapter 3-Processes
43 pages
Tanaya Thakur Shraddha Tamhane Rupali Mam
No ratings yet
Tanaya Thakur Shraddha Tamhane Rupali Mam
10 pages
DC Assignment 1
No ratings yet
DC Assignment 1
15 pages
Andrew Godwin - Reinventing Django For The Real-Time Web PDF
No ratings yet
Andrew Godwin - Reinventing Django For The Real-Time Web PDF
55 pages
Byte Python Concurrent and Parallel Programming V2
No ratings yet
Byte Python Concurrent and Parallel Programming V2
38 pages
Howto Sockets
No ratings yet
Howto Sockets
6 pages
Channels
No ratings yet
Channels
55 pages
Howto Sockets
No ratings yet
Howto Sockets
6 pages
Layered Protocols: Communication
No ratings yet
Layered Protocols: Communication
14 pages
Chapter 3, Process
No ratings yet
Chapter 3, Process
24 pages
#2 Servlets
No ratings yet
#2 Servlets
23 pages
Rabbitmq
No ratings yet
Rabbitmq
24 pages
Howto Sockets
No ratings yet
Howto Sockets
7 pages
Tech Stack
No ratings yet
Tech Stack
54 pages
AWS Application
No ratings yet
AWS Application
1 page
Celery
No ratings yet
Celery
742 pages
Chapter 3 Processes
No ratings yet
Chapter 3 Processes
37 pages
Pygrunn 2014
No ratings yet
Pygrunn 2014
44 pages
Chapter 3 Processes
No ratings yet
Chapter 3 Processes
20 pages
Client Server Concepts DNS, Telnet, FTP
No ratings yet
Client Server Concepts DNS, Telnet, FTP
30 pages
Distributed System Notes
No ratings yet
Distributed System Notes
10 pages
CS10_Communication
No ratings yet
CS10_Communication
22 pages
Distributed Object Based System
No ratings yet
Distributed Object Based System
11 pages
Kafka Developer Certified: The Essential Guide
From Everand
Kafka Developer Certified: The Essential Guide
SUJAN
No ratings yet
chapter1sensors-and-devices-jntuh-cse-iot-ii-year-ii-semester
No ratings yet
chapter1sensors-and-devices-jntuh-cse-iot-ii-year-ii-semester
22 pages
PDF&Rendition=1
No ratings yet
PDF&Rendition=1
78 pages
Cloud Platforms in Industry
No ratings yet
Cloud Platforms in Industry
18 pages
Windows C++ Leason
No ratings yet
Windows C++ Leason
191 pages
Middleware 8
No ratings yet
Middleware 8
2 pages
Intell NetStrIntel NetStructure® SS7 Protocolsucture Software Environment Programmer's Manual
No ratings yet
Intell NetStrIntel NetStructure® SS7 Protocolsucture Software Environment Programmer's Manual
45 pages
IME- Module-5 Notes
No ratings yet
IME- Module-5 Notes
26 pages
Unit 5A TL - Process To Process delivery-UDP
No ratings yet
Unit 5A TL - Process To Process delivery-UDP
31 pages
Technical Paper - SQL Server Service Broker
No ratings yet
Technical Paper - SQL Server Service Broker
12 pages
800xa 5.1 Batch Management Overview
No ratings yet
800xa 5.1 Batch Management Overview
8 pages
Pushpaja Resume
No ratings yet
Pushpaja Resume
3 pages
Asynchronous Programming in Rust 1st Edition Carl Fredrik Samson - The latest ebook edition with all chapters is now available
100% (3)
Asynchronous Programming in Rust 1st Edition Carl Fredrik Samson - The latest ebook edition with all chapters is now available
55 pages
Distributed Computing practice questions Chapter 4 pt2
No ratings yet
Distributed Computing practice questions Chapter 4 pt2
6 pages
9.1-Amazon SNS - Digital Cloud Training PDF
No ratings yet
9.1-Amazon SNS - Digital Cloud Training PDF
5 pages
OpenText Communications Center Enterprise 16.0 - Supervisor User Guide English (CCMWEBRETR160000-UGD-EN-03)
100% (1)
OpenText Communications Center Enterprise 16.0 - Supervisor User Guide English (CCMWEBRETR160000-UGD-EN-03)
44 pages
Workflow Manager
No ratings yet
Workflow Manager
64 pages
IME-module-5-notes
No ratings yet
IME-module-5-notes
27 pages
RabbitMQ in Depth 1st Edition Gavin M Roy download
100% (1)
RabbitMQ in Depth 1st Edition Gavin M Roy download
58 pages
Middleware 7
No ratings yet
Middleware 7
4 pages
All Method
No ratings yet
All Method
3 pages
How To Correlate JMS Messages (NW7.0) PDF
No ratings yet
How To Correlate JMS Messages (NW7.0) PDF
33 pages
Natural Language Processing Based Disaster Management Framework
No ratings yet
Natural Language Processing Based Disaster Management Framework
25 pages
Device Network SDK (Queue Management) - Developer Guide - V6.0.2.X - 20230330
No ratings yet
Device Network SDK (Queue Management) - Developer Guide - V6.0.2.X - 20230330
209 pages
Tcpip Used Ports
No ratings yet
Tcpip Used Ports
10 pages
Chapter 4 Communication
No ratings yet
Chapter 4 Communication
10 pages
Ch12 - Overview of Software Architecture
No ratings yet
Ch12 - Overview of Software Architecture
19 pages
03 Communication PDF
No ratings yet
03 Communication PDF
72 pages
SIB best practices-WSTE Sep 06th 2016 v5_ 09_05
No ratings yet
SIB best practices-WSTE Sep 06th 2016 v5_ 09_05
39 pages
DC IAT1
No ratings yet
DC IAT1
20 pages
INEWS Web Services API 1-2-0
No ratings yet
INEWS Web Services API 1-2-0
1 page