(⬑Table of Contents) (⬑API Index)

Output Buffering

This API may be installed using:

whcl install-api ob

or calling whcl_install_ob() from C code.

It adds a whcl[ob] object with the API described below.

Jump to...

OB: Output Buffering
OB Methods
Notes & Caveats

Output Buffering

One of cwal's core features is the ability for the client to specify an "output channel" - a client-defined callback function through which all script-generated output "should" (by well-behaved clients) be sent. The "ob" API builds upon that, providing functionality similar to PHP's ob_start() family of functions. This allows capturing output for emitting later on. e.g. as one might do when buffering HTTP payload output so that HTTP headers can be sent before the payload (coughthe CGI modulecough).

In short, one "pushes" a buffer on the stack, and then any output which is generated via the normal output channel (specifically, output going through the C-level cwal_output() family of functions) are intercepted and appended to the current buffer. When the client is finished, one pops the buffer from the stack and output resumes its regularly programmed course. This API allows the client to discard the buffered output, fetch it as a string or Buffer, or "flush" it, all demonstrated and described below.

Reminder to self: we can't pass the underlying Buffers back to scripts because they are necessarily created outside of the Value management system, without an attached Value instance, because their lifetimes would otherwise be unduly problematic for many legal/conceivable use cases unless the client managed all buffer references himself (e.g. ob push would return the buffer and expect the client to manage it).

This API can be installed into a client-side interpreter using whcl_install_ob(). whclsh installs it as whcl[ob].

Example usage (the indentation matches the buffering levels):

decl -const ob whcl[ob]
ob push
assert 1 === [ob level];
    echo "This will be flushed to stdout."
    ob flush
    echo "level 1"
    decl -const v1 [ob take-string]
    ob push
        echo "This will be flushed to level 1."
        ob flush
        echo "level 2"
        decl -const v2 [ob take-string]
    ob pop
    decl -const v1b [ob take-string]
    echo "discarded"
    #ob clear; # not needed b/c pop will do this
ob pop
assert $v1 == 'level 1\n';
assert $v2 == 'level 2\n';
assert $v1b == 'This will be flushed to level 1.\n';

That outputs only the following:

This will be flushed to stdout.

While this example uses echo for generating output, output capturing applies when using any routines which use (perhaps indirectly) cwal_output() (or one of its sibling APIs) to generate their output. It does not (cannot) apply to code output via lower-level C routines like puts(3) or printf(3), and script binders are strongly discouraged from using those in script bindings. Using cwal_output() routes the data through the same channel the rest of the script world uses, which is far more flexible than using lower-level output routines (e.g. it allows us to implement buffering, send all output to a given file, or disable output altogether).

It is important to note that the memory owned by the underlying buffer(s) is managed outside of the interpreter's garbage collection system, a side effect of which is that push/pop operations may span script-side scopes. The get-string method (and similar ones) transform the underlying content into something managed by script-space, making that copy subject to the normal lifetime/garbage collection rules (the scope calling the method will be the initial owning scope for newly-created values).

The capture method (added several years after the above text was written) greatly simplifies the process of keeping the OB levels consistent, effectively removing the onus of popping from the user and guaranteeing that the levels stay consistent in the face of exceptions and similar error conditions.

OB Methods

The OB object's methods are implemented as subcommands of the OB object. They do not have script-accessible handles and cannot be reached from script-space except by invoking them as subcommands of the OB object.

capture

Usage: capture callback [int captureMode=-1 | buffer captureTarget]

This convenience routine pushes an OB level, runs a callback function or evals a callback string, captures the output of all OB levels pushed since it was called, then restores the OB level to its pre-call state.

Throws if, after the callback, the OB level is lower than it was before the callback was called/eval'd. Such a case indicates serious mismanagement of the OB levels. The callback may push any number of OB levels but is not required to pop them: if it leaves extra OB levels on the stack, this function will capture them and pop them.

If the 2nd argument is a buffer, all captured output is appended to that buffer and that buffer is returned. If it's not a buffer, it's interpreted as an integer with the same semantics as pop's argument but with a different default value: if it's negative (the default) then the captured buffered output is returned as a string, positive returns the result as a new buffer, and 0 means to simply discard the result.

Managing OB levels is easier and safer with this approach, compared to manually managing push/pop levels, because it keeps the OB levels consistent even if the callback triggers an script-level assert, exit, fatal, exception, an out-of-memory failure, C-level interruption via whcl_interrupt() (typically via Ctrl-C), or a similar "flow-control event."

clear

Usage: clear

Discards the contents of the current buffering level but leaves the buffer in place. Returns this. Throws if buffering is not active.

flush

Usage: flush

Pushes the current contents of the current buffer level down one level, such that it either gets appended to another buffer (if buffering is nested) or goes to the default configured output channel (if the first buffer level is flushed). Empties out the current buffer contents but leaves the buffer in place (does not change the level). Returns this. Throws if buffering is not active.

get-string

Usage: get-String

Returns the current buffer contents as a string and leaves the buffer unmodified. Throws if buffering is not active.

level

Usage: level

Returns the current buffering level, or 0 if not currently buffering.

pop

Usage: pop [int takePolicy=0]

Pops the current level of buffering. Must only be called after a corresponding call to push (or it will throw). If passed no arguments or passed a falsy value then it discards any buffered data, freeing up its memory and returning this. If passed an numeric value greater than 0 it returns the buffered contents as a Buffer (exactly as for take-buffer). If passed a numeric value less than 0, it returns the contents as a String (exactly as for take-string).

Throws if buffering is not active.

push

Usage: push

Pushes a level to the OB stack. Must be accompanied by a matching call to pop unless (special case) the current OB level is being captured, in which case capture will pop it if the script does not do so. Returns this.

take-buffer

Usage: takeBuffer

"Takes" the underlying buffer away from the current buffering level, effectively clearing it and transferring the contents, in the form of a Buffer value, to the caller. Does not change the buffering level, and generating more output at this level will create a new buffer to store it in.

Throws if buffering is not active.

Sidebar: we cannot return a handle to the underlying Buffer without taking it away from the OB layer because it is not associated with a Value instance (because the lifetimes cannot be sanely managed that way).

take-string

Usage: take-string

Equivalent to calling get-string then clear. This does not change the buffering level, only transfers output buffered at the current buffer level (including data flush'd into it from a subsequently-pushed OB scope). Generating more output at this level will create a new buffer to store it.

Throws if buffering is not active.

Notes & Caveats

It's important that clients keep the buffering levels correct, meaning that they must call pop one time for each time they call push. Failing to do so can cause output to get "lost" or unintentionally hidden from view. It is often necessary to use catch blocks to ensure that a pending call to pop is not missed, or a thrown exception's output might be hidden by that buffering level. Or...
Using capture, rather than push and pop, makes guaranteeing consistent OB levels trivial, even in cases which script code cannot respond to, e.g. when the stack is currently unwinding due to a script-level exit or failed assert.
While the API could, in principle, support various types of buffering modules (like PHP's "gz" module), its infrastructure currently does not do so. Potentially useful, yes, but not high on the priority list.

whcl: Output Buffering

(⬑Table of Contents) (⬑API Index)

Output Buffering

Output Buffering

OB Methods

capture

clear

flush

get-string

level

pop

push

take-buffer

take-string

Notes & Caveats