Package passert contains verification transformations for testing pipelines. The transformations are not tied to any particular runner, i.e., they can notably be used for remote execution runners, such as Dataflow.
Diff splits 2 incoming PCollections into 3: left only, both, right only. Duplicates are preserved, so a value may appear multiple times and in multiple collections. Coder equality is used to determine equality. Should only be used for small collections, because all values are held in memory at the same time.
Empty asserts that col is empty.
Equals verifies the given collection has the same values as the given values, under coder equality. The values can be provided as single PCollection.
False asserts that the given predicate does not satisfy any element in the condition.
Hash validates that the incoming PCollection<string> has the given size and base64-encoded MD5 hash code. It buffers the entire PCollection in memory and sorts it for determinism.
Sum validates that the incoming PCollection<int> is a singleton with the given value. Specialized version of Equals that avoids a lot of machinery for testing.
True asserts that all elements satisfy the given predicate.