beam: Files

Command windowed_wordcount

windowed_wordcount counts words in text, and can run over either unbounded or bounded input collections.

This example is the last in a series of four successively more detailed 'word count' examples. First take a look at minimal_wordcount, wordcount, and debugging_wordcount.

Basic concepts, also in the preceeding examples: Reading text files; counting a PCollection; writing to GCS; executing a Pipeline both locally and using a selected runner; defining DoFns; user-defined PTransforms; defining pipeline options.

New Concepts:

1. Unbounded and bounded pipeline input modes
2. Adding timestamps to data
3. Windowing
4. Re-using PTransforms over windowed PCollections
5. Accessing the window of an element

Package Files


Package main imports 13 packages (graph). Updated 2018-06-03. Refresh now. Tools for package owners. This is an inactive package (no imports and no commits in at least two years).