In Which We Begin at the Beginning

Post author By dave
Post date August 7, 2009
86 Comments on In Which We Begin at the Beginning

Part 1: In Which We Begin at the Beginning

Preface

Someone recently posted to the Twisted mailing list asking for something like the “Twisted introduction for people on a deadline”. Full disclosure: this isn’t it. On the spectrum of introductions to Twisted and asynchronous programming in Python, it may be on the exact opposite end. So if you don’t have any time, or any patience, this isn’t the introduction you are looking for.

However, I also believe that if you are new to asynchronous programming, a quick introduction is simply not possible, at least if you are not a genius. I’ve used Twisted successfully for a number of years and having thought about how I initially learned it (slowly), and what I found difficult, I’ve come to the conclusion that much of the challenge does not stem from Twisted per se, but rather in the acquisition of the “mental model” required to write and understand asynchronous code. Most of the Twisted source code is clear and well written, and the online documentation is good, at least by the standards of most free software. But without that mental model, reading the Twisted codebase, or code that uses Twisted, or even much of the documentation, will result in confusion and headache.

So the first parts of this introduction are designed to help you acquire that model and only later on will we introduce the features of Twisted. In fact, we will start without using Twisted at all, instead using simple Python programs to illustrate how an asynchronous system works. And once we get into Twisted, we will begin with very low-level aspects that you would not normally use in day-to-day programming. Twisted is a highly abstracted system and this gives you tremendous leverage when you use it to solve problems. But when you are learning Twisted, and particularly when you are trying to understand how Twisted actually works, the many levels of abstraction can cause troubles. So we will go from the inside-out, starting with the basics.

And once you have the mental model in place, I think you will find reading the Twisted documentation, or just browsing the source code, to be much easier. So let’s begin.

The Models

We will start by reviewing two (hopefully) familiar models in order to contrast them with the asynchronous model. By way of illustration we will imagine a program that consists of three conceptually distinct tasks which must be performed to complete the program. We will make these tasks more concrete later on, but for now we won’t say anything about them except the program must perform them. Note I am using “task” in the non-technical sense of “something that needs to be done”.

The first model we will look at is the single-threaded synchronous model, in Figure 1 below:

This is the simplest style of programming. Each task is performed one at a time, with one finishing completely before another is started. And if the tasks are always performed in a definite order, the implementation of a later task can assume that all earlier tasks have finished without errors, with all their output available for use — a definite simplification in logic.

We can contrast the synchronous model with another one, the threaded model illustrated in Figure 2:

In this model, each task is performed in a separate thread of control. The threads are managed by the operating system and may, on a system with multiple processors or multiple cores, run truly concurrently, or may be interleaved together on a single processor. The point is, in the threaded model the details of execution are handled by the OS and the programmer simply thinks in terms of independent instruction streams which may run simultaneously. Although the diagram is simple, in practice threaded programs can be quite complex because of the need for threads to coordinate with one another. Thread communication and coordination is an advanced programming topic and can be difficult to get right.

Some programs implement parallelism using multiple processes instead of multiple threads. Although the programming details are different, for our purposes it is the same model as in Figure 2.

Now we can introduce the asynchronous model in Figure 3:

In this model, the tasks are interleaved with one another, but in a single thread of control. This is simpler than the threaded case because the programmer always knows that when one task is executing, another task is not. Although in a single-processor system a threaded program will also execute in an interleaved pattern, a programmer using threads should still think in terms of Figure 2, not Figure 3, lest the program work incorrectly when moved to a multi-processor system. But a single-threaded asynchronous system will always execute with interleaving, even on a multi-processor system.

There is another difference between the asynchronous and threaded models. In a threaded system the decision to suspend one thread and execute another is largely outside of the programmer’s control. Rather, it is under the control of the operating system, and the programmer must assume that a thread may be suspended and replaced with another at almost any time. In contrast, under the asynchronous model a task will continue to run until it explicitly relinquishes control to other tasks. This is a further simplification from the threaded case.

Note that it is possible to mix the asynchronous and threaded models and use both in the same system. But for most of this introduction, we will stick to “plain vanilla” asynchronous systems with one thread of control.

The Motivation

We’ve seen that the asynchronous model is simpler than the threaded one because there is a single instruction stream and tasks explicitly relinquish control instead of being suspended arbitrarily. But the asynchronous model is clearly more complex than the synchronous case. The programmer must organize each task as a sequence of smaller steps that execute intermittently. And if one task uses the output of another, the dependent task must be written to accept its input as a series of bits and pieces instead of all together. Since there is no actual parallelism, it appears from our diagrams that an asynchronous program will take just as long to execute as a synchronous one, perhaps longer as the asynchronous program might exhibit poorer locality of reference.

So why would you choose to use the asynchronous model? There are at least two reasons. First, if one or more of the tasks are responsible for implementing an interface for a human being, then by interleaving the tasks together the system can remain responsive to user input while still performing other work in the “background”. So while the background tasks may not execute any faster, the system will be more pleasant for the person using it.

However, there is a condition under which an asynchronous system will simply outperform a synchronous one, sometimes dramatically so, in the sense of performing all of its tasks in an overall shorter time. This condition holds when tasks are forced to wait, or block, as illustrated in Figure 4:

Figure 4: blocking in a synchronous program

In the figure, the gray sections represent periods of time when a particular task is waiting (blocking) and thus cannot make any progress. Why would a task be blocked? A frequent reason is that it is waiting to perform I/O, to transfer data to or from an external device. A typical CPU can handle data transfer rates that are orders of magnitude faster than a disk or a network link is capable of sustaining. Thus, a synchronous program that is doing lots of I/O will spend much of its time blocked while a disk or network catches up. Such a synchronous program is also called a blocking program for that reason.

Notice that Figure 4, a blocking program, looks a bit like Figure 3, an asynchronous program. This is not a coincidence. The fundamental idea behind the asynchronous model is that an asynchronous program, when faced with a task that would normally block in a synchronous program, will instead execute some other task that can still make progress. So an asynchronous program only “blocks” when no task can make progress and is thus called a non-blocking program. And each switch from one task to another corresponds to the first task either finishing, or coming to a point where it would have to block. With a large number of potentially blocking tasks, an asynchronous program can outperform a synchronous one by spending less overall time waiting, while devoting a roughly equal amount of time to real work on the individual tasks.

Compared to the synchronous model, the asynchronous model performs best when:

There are a large number of tasks so there is likely always at least one task that can make progress.
The tasks perform lots of I/O, causing a synchronous program to waste lots of time blocking when other tasks could be running.
The tasks are largely independent from one another so there is little need for inter-task communication (and thus for one task to wait upon another).

These conditions almost perfectly characterize a typical busy network server (like a web server) in a client-server environment. Each task represents one client request with I/O in the form of receiving the request and sending the reply. And client requests (being mostly reads) are largely independent. So a network server implementation is a prime candidate for the asynchronous model and this is why Twisted is first and foremost a networking library.

Onward and Upward

This is the end of Part 1. In Part 2, we will write some network programs, both blocking and non-blocking, as simply as possible (without using Twisted), to get a feel for how an asynchronous Python program actually works.

86 replies on “In Which We Begin at the Beginning”

Really well written and clear.

My god, you can write. Not that I’m surprised–this is just the longest sample I’ve ever seen from you. Lovely.

Thank you, ma’am!

Hi. Nice work!

A typo: s/under the synchronous model a task will continue/under the asynchronous model a task will continue/

Terry

Good catch, thanks!

Wow! Nice work!

Really Awesome

Thank you.

[…] May 18, 2010 by maitscha Found an very interesting block about asynchronous programming and the Twisted web-framework: An Introduction to Asynchronous Programming and Twisted. […]

[…] originaalversioon: http://krondo69349291.wpcomstaging.com/blog/?p=1209 59.421506 […]

This is written in a easy to understand manner. The emphasis that twisted is a networking library and foremost suitable for writing network server should give some context to person desiring to study twisted. Also thinking about a network server and how the life of a network server operates in waiting state and servicing clients should give some idea too. Thank you.

hi, thanx for the blog; i found it really useful and it helped me with my studies. I’d just like to put one question forward, could you post some more blogs about different variations of programming?

Glad you like it. I’m not sure what you mean by variations of programming, though, could you explain?

I would love to see this in a PDF type file format, thus allowing it be read without network connections (kindle, nock…) would work nice for this type of document.

Have a nice day

Hi Donald, as soon as I finish the series (2 more to go), I’m going to look into doing just that.

thanks,
dave

I have been looking for such model comparisons for quite sometime. This is the best insight, I can say. Many thanks!

You are welcome!

This is very well written with good examples and explanations. There were many times I would read a sentence and then have a question, but I kept reading and you just seemed to have answered all of mine! Thank you, nice work!

You are quite welcome, thanks for the kind words.

Oh, and I would like to see a pdf version of this too. I don’t always have an internet connection and it would be very convenient if I could read this offline.

I’m trying out the joliprint service to see how it does. Each post has a joliprint
link at the bottom of the post (but before the comments) that makes a pdf of
the page on the fly. Let me know how that works for you.

It is an absolute pleasure to read your tutorials.

Thank you for such an awesome work!

Thank you very much!

Dave, you have done Twisted newbies and the Python community a big favour with this exceptional tutorial, have no doubt about it. For the first time, I feel I have a foot in the door of this fascinating but not easy to approach technology. So, simply, thank you.

Theodore

You are welcome, Theodore.

hello,dave.I’m a graduate from china.
Thank you for your good articles,which help me a lot.
I have a idea that to translate your articles to chinese and put them on my blog of sina.
Is there any question about copyright?

Hello! You are quite welcome to translate the articles, thank you. Please send me a link to your translation and I will post it on the main index page of the introduction.

Thank you! I’m translating it these days.I will send you it when I finish them.
And thank you for your perfect articals again!

Cool, thanks very much.

A very well written, simple to understand article. It was a great read.

Thank you!

[…] 发表于 2011 年 05 月 18 日由 notedit 原文:http://krondo69349291.wpcomstaging.com/blog/?p=1209 作者:dave 译者:notedit 日期:2011.05.18 […]

Well done, Dave. Great article.I wish I could have seen it earlier.
Best Regards from China

Thank you very much.

Really good article!!

I have the twisted networking essentials book from O’Reilly, but the fundamentals about asynchronous programming you explain here really increased my insight on the subject, and felt like a more natural, proper introduction into the library. It’s also very well written, clear and easy to understand. After reading through this I’d feel guilty not writing a “Thank You” comment! Anyway, thanks alot Dave, this is by far the best intro I found on the topic.

Thank you, you are welcome.

A simple yet excellent article.

Neat work.

Thanks.

I take my hat off to you for your awesome skills in abstracting technology in such a well-structured pattern!

Thanks very much!

This is a very useful series of articles. You should compile them, with an index page. You should also have them linked from the Twisted documentation.

Thank you! Here is the index page:

http://krondo69349291.wpcomstaging.com/?page_id=1327

and here is the link on Twisted to my introduction:

http://twistedmatrix.com/trac/wiki/Documentation

Beautifully written !!

Thank you, I appreciate that!

[…] This is an excellent explanation of the single threaded asynchronous model of programming. It’s refers to Twisted, but many of the points apply generally. Like this:LikeBe the first to like this post. Posted in Uncategorized | Leave a Comment » […]

Hi Dave, Great Tut!

“””
The programmer must organize each task as a sequence of smaller steps that execute intermittently. And if one task uses the output of another, the dependent task must be written to accept its input as a series of bits and pieces instead of all together.
“””

any examples for this concept in your tutorial?

Thanks!

Hello, glad you liked the tutorial. All the later examples that use asynchronous programming have to deal with their input in small bits as they come in (the examples use artificially slow servers to illustrate the concepts).

Thank you *VERY* *MUCH*, Dave ! 🙂
And a big thanks for publishing this as PDF, as well.

– a guy

You are welcome, glad it was helpful.

My teacher! This is where I will adapt myself to async programming. I was sooo frustrated with myself as it took soo long for me to fix a bug, only because I was confused and completely lost in the maze of callbacks and ErrBacks.

Root cause is not my lack of understanding of twisted, but not having the mindset for async programming. This will really help shorten my learning curve. Can’t thank you enough for your time and efforts

Have fun!

[…] In Which We Begin at the Beginning […]

Thanks!!!!

Hi, Ive been following your Twisted tutorials for the last couple of days. I would like to say that you have done a great job by writing a nice easy to understand Twisted tutorial.

I intend to write an Android App. Can I use Twisted for Android programming? Could you point me to some resources? Thanks in advance!

Hello! I actually have no experience whatsoever with Android programming so I’m not really sure. I believe there is a Python implementation for it, though, so I imagine it ought to be possible.

Great Piece of work!!

Thanks very much!

Great tutorial! You made it easy for me to understand async concepts in python/twisted 🙂

I’m writing a report about async programming for school.
Would you allow me to use the figures from your tutorial in it?

Of course I will add your article in the references list 🙂

Thank you

Yes, please feel free to do so. And you are welcome!

Still one of the most awesome posts on network programming ever! Thank You!!!!

And thank you!

[…] Introduction to Asynchronous Programming and Twisted […]

many thanks for your blog. Amazing.

Thank you!

Someone on HN asked for instances of excellent tutorials and I remembered this one years later 🙂

Thank you very much 🙂

I dint know head or tails about the theory behind async programming and twisted framework.
Extremely helpfull Article…Thank you for helping me build the right mental model!!

You are welcome, glad it was helpful!

[…] 本部分原作参见: dave http://krondo69349291.wpcomstaging.com/?p=1209 […]

A dog who code 🙂 nise. I’ll read every line of that as my learning way. I knew twisted is good by mentions and so, but nevermind how much special it is and why do i see it many many times 🙂 thank you for your time and knowledge shared. For someone who is not English like me it’s not so hard to understand you, with pics. You wasted +2h just writting 😛

Of course joking a bit, nice and well redacted explanations.

Thanks very much, glad you like it!

[…] In Which We Begin at the Beginning […]

Excellent explanation.

Damn this tutorial is great.
Will bookmark this great website.
Only problem I have is this reply was hard to do, since there were so many others to scroll through 😉

Thank you, glad you liked it!

alert(‘plzwork i want da hax’)

script>

I hope you keep it refreshed with latest changes in the twisted library so that whoever reads it whenever.. its not outdated for him/her.
Thanks for this though, Much needed.

You are welcome! I don’t keep it updated, though, I just don’t have the time.

[…] This document is not for node(it’s for a python asynchronous framework “twisted”) but might be helpful for you. […]

So clear and nice explanation about the topic.

Thanks a bunch!

[…] Which We Begin at the Beginning – krondo: http://krondo69349291.wpcomstaging.com/in-which-we-begin-at-the-beginning 中文译文版: […]

Part 1: In Which We Begin at the Beginning

Preface

The Models

The Motivation

Onward and Upward

Share this:

Related

86 replies on “In Which We Begin at the Beginning”

Leave a ReplyCancel reply

Discover more from krondo