Trying to understand generators / yield in node.js - what executes the asynchronous function?

Duke Dougal picture Duke Dougal · Jul 8, 2013 · Viewed 16.7k times · Source

Node.js now has generators.

My understanding is that generators can be used to write code that appears to be much more linear and avoids callback hell and pyramid of doom style coding.

So to this point, my understanding is that inside a generator, code executes until it reaches a "yield" statement. Execution of the generator function suspends at this point. The yield statement specifies a return value which may be a function. Typically this would be a blocking I/O function - one that would normally need to be executed asynchronously.

The yield's return function is returned to whatever called the generator.

My question is, what happens at this point? What exactly executes the blocking I/O function that the yield returned?

Is it correct that to write generator/yield code that appears to be linear, there needs to be a specific sort of function that is calling the generator, a function that loops through the generator and executes each asynch function returned by the yield and returns the result of the asynch function back into the generator?

It's still not clear to me exactly how the asynch function returned by the yield is executed. If it is executed by the function that calls the generator, is it executed asynchronously? I'm guessing so because to do otherwise would result in blocking behaviour.

To summarise my questions:

  1. To write "linear" asynch code with generators, is it necessary for there to be a calling function that iterates over the generator, executing yielded functions as callbacks and returning the result of the callback back into the generator?
  2. If the answer to question 1 is yes, the exactly how are the yielded functions executed - asynchronously?

Can anyone offer a better overview/summary of how the whole process works?

Answer

Bruno Jouhier picture Bruno Jouhier · Sep 19, 2013

When writing async code with generators you are dealing with two types of functions:

  • normal functions declared with function. These functions cannot yield. You cannot write async code in sync style with them because they run to completion; you can only handle asynchronous completion through callbacks (unless you invoke extra power like the node-fibers library or a code transform).
  • generator functions declared with function*. These functions can yield. You can write async code in sync style inside them because they can yield. But you need a companion function that creates the generator, handles the callbacks and resumes the generator with a next call every time a callback fires.

There are several libraries that implement companion functions. In most of these libraries, the companion function handles a single function* at a time and you have to put a wrapper around every function* in your code. The galaxy library (that I wrote) is a bit special because it can handle function* calling other function* without intermediate wrappers. The companion function is a bit tricky because it has to deal with a stack of generators.

The execution flow can be difficult to understand because of the little yield/next dance between your function* and the companion function. One way to understand the flow is to write an example with the library of your choice, add console.log statements both in your code and in the library, and run it.