Developer's notes

These notes are not for end users but rather for developers who are interested in the design of the package.

TranscodingStream type

TranscodingStream{C,S} (defined in src/stream.jl) has three fields:

  • codec: data codec (<:C where C<:Codec)
  • stream: data stream (<:S where S<:IO)
  • state: current state (<:State).

A codec will be implemented by package developers and only a special codec Noop is defined in this package. A stream can be any object that implements at least Base.isopen, Base.eof, Base.close, Base.bytesavailable, Base.unsafe_read, and Base.unsafe_write. All mutable fields are delegated to state and hence the stream type itself is immutable.

A stream has two buffers in the state field. These are used to store pre-transcoded and transcoded data in the stream. The stream passes references of these two buffers to the codec when processing data. The following diagram illustrates the flow of data:

When reading data (`state.mode == :read`):
  user <--- |state.buffer1| <--- <stream.codec> <--- |state.buffer2| <--- stream

When writing data (`state.mode == :write`):
  user ---> |state.buffer1| ---> <stream.codec> ---> |state.buffer2| ---> stream

In the read mode, a user pull out data from state.buffer1 and pre-transcoded data are filled in state.buffer2. In the write mode, a user will push data into state.buffer1 and transcoded data are filled in state.buffer2. The default buffer size is 16KiB for each.

State (defined in src/state.jl) has five fields:

  • mode: current stream mode (<:Symbol)
  • code: return code of the last codec's method call (<:Symbol)
  • error: exception returned by the codec (<:Error)
  • buffer1: data buffer that is closer to the user (<:Buffer)
  • buffer2: data buffer that is farther to the user (<:Buffer)
  • bytes_written_out: number of bytes written to the underlying stream (<:Int64)

The mode field may be one of the following value:

  • :idle : initial and intermediate mode, no buffered data
  • :read : being ready to read data, data may be buffered
  • :write: being ready to write data, data may be buffered
  • :stop : transcoding is stopped after read, data may be buffered
  • :close: closed, no buffered data
  • :panic: an exception has been thrown in codec, data may be buffered but we cannot do anything

Note that mode=:stop does not mean there is no data available in the stream. This is because transcoded data may be left in the buffer.

The initial mode is :idle and mode transition happens as shown in the following diagram: Mode transition

Modes surrounded by a bold circle are a state in which the transcoding stream has released resources by calling finalize(codec). The mode transition should happen in the changemode!(stream, newmode) function in src/stream.jl. Trying an undefined transition will thrown an exception.

A transition happens according to internal or external events of the transcoding stream. The status code and the error object returned by codec methods are internal events, and user's method calls are external events. For example, calling read(stream) will change the mode from :init to :read and then calling close(stream) will change the mode from :read to :close. When data processing fails in the codec, a codec will return :error and the stream will result in :panic.

Shared buffers

Adjacent transcoding streams may share their buffers. This will reduce memory allocation and eliminate data copy between buffers.

If buffer2 is shared it is considered to be owned by the underlying stream by the stats and position functions.

readdata!(input::IO, output::Buffer) and flush_buffer2(stream::TranscodingStream) do the actual work of read/write data from/to the underlying stream. These methods have a special pass for shared buffers.

Noop codec

Noop (NoopStream) is a codec that does nothing. It works as a buffering layer on top of the underlying stream. Since NoopStream does not need to have two distinct buffers, buffer1 and buffer2 in the State object are shared and some specialized methods are defined for the type. All of these are defined in src/noop.jl.