Packages

  • package root
    Definition Classes
    root
  • package io
    Definition Classes
    root
  • package dylemma
    Definition Classes
    io
  • package spac

    SPaC (short for "Streaming Parser Combinators") is a library for building stream consumers in a declarative style, specialized for tree-like data types like XML and JSON.

    SPaC (short for "Streaming Parser Combinators") is a library for building stream consumers in a declarative style, specialized for tree-like data types like XML and JSON.

    Many utilities for handling XML and JSON data involve parsing the entire "document" to some DOM model, then inspecting and transforming that model to extract information. The downside to these utilities is that when the document is very large, the DOM may not fit in memory. The workaround for this type of problem is to treat the document as a stream of "events", e.g. "StartElement" and "EndElement" for XML, or "StartObject" and "EndObject" for JSON. The downside to this workaround is that writing code to handle these streams can be complicated and error-prone, especially when the DOM is complicated.

    SPaC's goal is to drastically simplify the process of creating code to handle these streams.

    This package contains the "core" SPaC traits; Parser, Transformer, Splitter, and ContextMatcher.

    See the xml and json subpackages (provided by the xml-spac and json-spac libraries respectively) for specific utilities related to handling XML and JSON event streams.

    Definition Classes
    dylemma
  • package interop
    Definition Classes
    spac
  • package json

    This package provides extensions to the core "spac" library which allow for the handling of JSON data.

    This package provides extensions to the core "spac" library which allow for the handling of JSON data.

    Rather than creating explicit classes that extend Parser, Transformer, and Splitter, this package provides type aliases and implicit extensions. For example, JsonParser[A] is just a type alias for Parser[JsonEvent, A], and JsonParser is just a call to Parser[JsonEvent].

    Implicit JsonParsers are available for each of the JSON primitive types:

    • string
    • number (expressed as Int, Long, Float, or Double)
    • boolean
    • null (expressed as None.type)

    Helpers are available for parsing JSON arrays and objects:

    • JsonParser.listOf[A] to parse an array where each value is an A
    • JsonParser.objectOf[A] to parse an object where the value for each field an A
    • JsonParser.objectOfNullable[A] to parse an object where the value for each field is either null or an A, filtering out the nulls
    • JsonParser.fieldOf[A](fieldName) to parse a specific field from an object

    A DSL for creating json-specific ContextMatchers is provided to make it more convenient to call Splitter.json. For example:

    Splitter.json("foo" \ "bar").as[String].parseFirst

    Can be used to capture rootJson.foo.bar as a String in

    {
      "foo": {
        "bar": "hello"
      }
    }

    To "split" values inside arrays, index-related context matchers are available, e.g.

    Splitter.json("foo" \ anyIndex).as[Int].parseToList

    Can be used to capture each of the numbers in the "foo" array in

    {
      "foo": [1, 2, 3]
    }

    A note about JsonEvents in spac: JSON doesn't have any explicit markers for when a field ends, or when an array index starts or ends; those context changes are essentially inferred by the presence of some other event. For example, instead of a "field end" event, typically there will be either a new "field start" or a token representing the end of the current object. With spac, splitters and context matchers generally operate under the assumption that a "stack push" event (like a field start) will eventually be followed by a corresponding "stack pop" event (i.e. field end).

    To allow for this, these "inferred" events (FieldEnd, IndexStart, IndexEnd) are explicitly represented as JsonEvents in the stream being parsed. Keep this in mind when creating JSON ContextMatchers:

    • field-related matchers will match a stack like case ObjectStart :: FieldStart(_) :: _
    • index-related matchers will match a stack like case ArrayStart :: IndexStart(_) :: _
    Definition Classes
    spac
  • package xml

    This package provides extensions to the core "spac" library which allow for the handling of XML data.

    This package provides extensions to the core "spac" library which allow for the handling of XML data.

    Rather than creating explicit classes that extend Parser, Transformer, and Splitter, this package provides type aliases and implicit extensions. For example, XmlParser[A] is just a type alias for Parser[XmlEvent, A], and XmlParser is just a call to Parser[XmlEvent].

    Three main Parser methods are added to Parser[XmlEvent] via the XmlParserApplyOps implicit class:

    • XmlParser.forText - for capturing raw text
    • XmlParser.attr - for capturing mandatory attributes from elements
    • XmlParser.attrOpt - for capturing optional attributes from elements

    One main Splitter constructor method is added to Splitter via the XmlSplitterApplyOps implicit class:

    • Splitter.xml - for creating splitters based on an inspection of an "element stack"

    Three main Splitter member methods are added to Splitter[XmlEvent, C] via the XmlSplitterOps implicit class:

    • .attr - alias for .joinBy(XmlParser.attr(...))
    • .attrOpt - alias for .joinBy(XmlParser.attrOpt(...))
    • .text - alias for .joinBy(XmlParser.forText)

    A DSL for creating xml-specific ContextMatchers is provided to make it more convenient to call Splitter.xml. For example:

    Splitter.xml("things" \ "thing").attr("foo").parseToList

    Can be used to capture a list of the "foo" attributes in the <thing> elements in

    <things>
       <thing foo="hello" />
       <thing foo="Goodbye">
          <extra>junk</extra>
       </thing>
    </thing>
    Definition Classes
    spac
  • CallerPos
  • ContextChange
  • ContextLocation
  • ContextMatcher
  • ContextPop
  • ContextPush
  • ContextTrace
  • HasLocation
  • LowPriorityTypeReduceImplicits
  • Parser
  • ParserApplyWithBoundInput
  • Signal
  • SingleItemContextMatcher
  • Source
  • SpacException
  • SpacTraceElement
  • Splitter
  • SplitterApplyWithBoundInput
  • StackInterpretation
  • StackLike
  • Transformer
  • TransformerApplyWithBoundInput
  • TypeReduce
  • Unconsable

trait Splitter[In, +C] extends AnyRef

Primary "spac" abstraction that acts as a selector for sub-streams within a single input stream.

A "sub-stream" is some series of consecutive values from the original stream, identified by a "context" value. Sub-streams do not overlap with each other.

For example, when handling a stream of XML events, you might want to create a Splitter that identifies the events representing elements at a specific location within the XML; something like an XPATH that operates on streams. When using xml-spac, you might construct a splitter like Splitter.xml("rootElem" \ "things" \ "thing"). This would identify a new sub-stream for each <thing> element that appears inside a <things> element, inside the <rootElem> element. An example sub-stream for a <thing> element might be ElemStart("thing"), Text("hello"), ElemEnd("thing").

A Splitter's general goal is to attach a Parser or Transformer to each sub-stream, passing the contents of that sub-stream through the attached Parser or Transformer in order to get an interpretation of that sub-stream (i.e. the Parser's result, or some emitted outputs from a Transformer). With the <thing> example above, you might attach a parser that concatenates the context all Text events it sees. I.e. XmlParser.forText. Since a separate parser handler will run for each sub-stream, this becomes something like "A stream of Strings which each represent the concatenated text from an individual <thing> element".

In

Data event type for the input stream

C

Context type used to identify each sub-stream

Source
Splitter.scala
Linear Supertypes
AnyRef, Any
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. Splitter
  2. AnyRef
  3. Any
Implicitly
  1. by any2stringadd
  2. by StringFormat
  3. by Ensuring
  4. by ArrowAssoc
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. All

Abstract Value Members

  1. abstract def addBoundaries: Transformer[In, Either[ContextChange[In, C], In]]

    Inject "boundary" events into an input stream, where a ContextPush represents the beginning of a new sub-stream, and a ContextPop represents the end of a sub-stream.

    Inject "boundary" events into an input stream, where a ContextPush represents the beginning of a new sub-stream, and a ContextPop represents the end of a sub-stream.

    returns

    A transformer that injects the boundary events into any given input stream

Concrete Value Members

  1. final def !=(arg0: Any): Boolean
    Definition Classes
    AnyRef → Any
  2. final def ##(): Int
    Definition Classes
    AnyRef → Any
  3. def +(other: String): String
    Implicit
    This member is added by an implicit conversion from Splitter[In, C] toany2stringadd[Splitter[In, C]] performed by method any2stringadd in scala.Predef.
    Definition Classes
    any2stringadd
  4. def ->[B](y: B): (Splitter[In, C], B)
    Implicit
    This member is added by an implicit conversion from Splitter[In, C] toArrowAssoc[Splitter[In, C]] performed by method ArrowAssoc in scala.Predef.
    Definition Classes
    ArrowAssoc
    Annotations
    @inline()
  5. final def ==(arg0: Any): Boolean
    Definition Classes
    AnyRef → Any
  6. def as[Out](implicit parser: Parser[In, Out]): Transformer[In, Out]

    Like joinBy, but the parser is passed implicitly

  7. final def asInstanceOf[T0]: T0
    Definition Classes
    Any
  8. def clone(): AnyRef
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws(classOf[java.lang.CloneNotSupportedException]) @native()
  9. def ensuring(cond: (Splitter[In, C]) => Boolean, msg: => Any): Splitter[In, C]
    Implicit
    This member is added by an implicit conversion from Splitter[In, C] toEnsuring[Splitter[In, C]] performed by method Ensuring in scala.Predef.
    Definition Classes
    Ensuring
  10. def ensuring(cond: (Splitter[In, C]) => Boolean): Splitter[In, C]
    Implicit
    This member is added by an implicit conversion from Splitter[In, C] toEnsuring[Splitter[In, C]] performed by method Ensuring in scala.Predef.
    Definition Classes
    Ensuring
  11. def ensuring(cond: Boolean, msg: => Any): Splitter[In, C]
    Implicit
    This member is added by an implicit conversion from Splitter[In, C] toEnsuring[Splitter[In, C]] performed by method Ensuring in scala.Predef.
    Definition Classes
    Ensuring
  12. def ensuring(cond: Boolean): Splitter[In, C]
    Implicit
    This member is added by an implicit conversion from Splitter[In, C] toEnsuring[Splitter[In, C]] performed by method Ensuring in scala.Predef.
    Definition Classes
    Ensuring
  13. final def eq(arg0: AnyRef): Boolean
    Definition Classes
    AnyRef
  14. def equals(arg0: AnyRef): Boolean
    Definition Classes
    AnyRef → Any
  15. def finalize(): Unit
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws(classOf[java.lang.Throwable])
  16. def flatMap[Out](transformMatches: (ContextPush[In, C]) => Transformer[In, Out]): Transformer[In, Out]

    Creates a new transformer by attaching an "inner" transformer to each sub-stream based on the sub-stream context.

    Creates a new transformer by attaching an "inner" transformer to each sub-stream based on the sub-stream context. For each sub-stream, a new transformer will be created, and the inputs from the sub-stream will be piped into the inner transformer. Anything that the inner transformer emits will be emitted by the returned transformer.

  17. def formatted(fmtstr: String): String
    Implicit
    This member is added by an implicit conversion from Splitter[In, C] toStringFormat[Splitter[In, C]] performed by method StringFormat in scala.Predef.
    Definition Classes
    StringFormat
    Annotations
    @inline()
  18. final def getClass(): Class[_ <: AnyRef]
    Definition Classes
    AnyRef → Any
    Annotations
    @native()
  19. def hashCode(): Int
    Definition Classes
    AnyRef → Any
    Annotations
    @native()
  20. final def isInstanceOf[T0]: Boolean
    Definition Classes
    Any
  21. def joinBy[Out](parser: Parser[In, Out]): Transformer[In, Out]

    Like map, but when you want to use the same parser for each sub-stream, regardless of the context value

  22. def map[Out](parseMatches: (C) => Parser[In, Out]): Transformer[In, Out]

    Creates a new transformer by attaching a new parser to each sub-stream based on the sub-stream context.

    Creates a new transformer by attaching a new parser to each sub-stream based on the sub-stream context. For each sub-stream, a new parser will be created, and inputs from the sub-stream will be piped into that parser. When the sub-stream ends, or if the parser finishes on its own, the parser's result will be emitted as an Out event.

    Out

    The parser's output type

    parseMatches

    Given the context for a sub-stream, return a parser to handle that sub-stream

    returns

    A transformer that will emit the result of each parsed sub-stream

  23. def mapTraced[Out](parseMatches: (ContextPush[In, C]) => Parser[In, Out]): Transformer[In, Out]

    Like map, but using the ContextPush associated with the sub-stream, instead of just the context value itself.

  24. final def ne(arg0: AnyRef): Boolean
    Definition Classes
    AnyRef
  25. final def notify(): Unit
    Definition Classes
    AnyRef
    Annotations
    @native()
  26. final def notifyAll(): Unit
    Definition Classes
    AnyRef
    Annotations
    @native()
  27. final def synchronized[T0](arg0: => T0): T0
    Definition Classes
    AnyRef
  28. def toString(): String
    Definition Classes
    AnyRef → Any
  29. final def wait(): Unit
    Definition Classes
    AnyRef
    Annotations
    @throws(classOf[java.lang.InterruptedException])
  30. final def wait(arg0: Long, arg1: Int): Unit
    Definition Classes
    AnyRef
    Annotations
    @throws(classOf[java.lang.InterruptedException])
  31. final def wait(arg0: Long): Unit
    Definition Classes
    AnyRef
    Annotations
    @throws(classOf[java.lang.InterruptedException]) @native()

Deprecated Value Members

  1. def [B](y: B): (Splitter[In, C], B)
    Implicit
    This member is added by an implicit conversion from Splitter[In, C] toArrowAssoc[Splitter[In, C]] performed by method ArrowAssoc in scala.Predef.
    Definition Classes
    ArrowAssoc
    Annotations
    @deprecated
    Deprecated

    (Since version 2.13.0) Use -> instead. If you still wish to display it as one character, consider using a font with programming ligatures such as Fira Code.

Inherited from AnyRef

Inherited from Any

Inherited by implicit conversion any2stringadd fromSplitter[In, C] to any2stringadd[Splitter[In, C]]

Inherited by implicit conversion StringFormat fromSplitter[In, C] to StringFormat[Splitter[In, C]]

Inherited by implicit conversion Ensuring fromSplitter[In, C] to Ensuring[Splitter[In, C]]

Inherited by implicit conversion ArrowAssoc fromSplitter[In, C] to ArrowAssoc[Splitter[In, C]]

Ungrouped