Advanced Users For KNIME Analytics Platform
Advanced Users For KNIME Analytics Platform
• Sorts a table!
• Choice of ascending or descending
• Sort by multiple columns
• Similar to Metanodes
• Differ in key areas:
– Local variable scope (global scope for
Metanodes)
– Configurable via Quick Form nodes
• Key to advanced functionality in KNIME
products:
– Use for new WebPortal pages
• Right-click or double-click a
Wrapped Metanode to
configure
• Use in WebPortal
Date Time
1 Month
To calculate difference to
second column, both columns
need to have the same type!
Select granularity
• Input: Date&Time
– Set time zone
• Input: Date&Time (Time zone)
– Set time zone
– Shift time zone
– Remove time zone
• Loops
– Iterate over a workflow snippet with variable
inputs.
• Switches
– Direct the path of a workflow by selectively
executing one or more workflow branches.
• Try-Catch
– Handle workflow branches that may fail in
execution and you don‘t know before
execution
• A loop block is defined by appropriate loop start and loop end nodes.
• Loop body = Nodes in between and side branches.
Loop body
Loop
end
Loop start node
node
(Hint: Group Loop Start creates a flow variable naming the current group)
Active
Inactive
Loops
– The Loop Start is connected to the Loop End node, they form a pair
Switches
– A Switch Start can be used without a corresponding Switch End. They
can also be combined.
• Standard execution: Node by node. Node processes all data, finishes, then
passes data to next node, etc.
• Streaming: Nodes executed concurrently, each nodes passes data to the
next as soon as it is available, i.e. before node is fully executed
– Faster execution, esp. for reading/preprocessing data
• Create wrapped metanode -> Configure -> Job Manager Selection ->
Simple Streaming
– Not available for all nodes (show in node repository)
– Can only execute entire metanode, not individual nodes
– Intermediate results not available since nothing is cached
…
crowd”
5 2 2 7 7 6
2 9 6 7 6 8 9 3 3 9 5 7
• Pick a different random subset of the training data for each model in the
ensemble (bag).
1 4 1
5 2 5 7
… 7 6
2 9 6 7 2 8 9 3 3 9 5 7
• Allows testing the model using the training data: when validating, each
model should only vote on data points that were not used to train it
X1 X2
1 4 1 1 4 1
5 2 2 7 … 7 6 5 2 2 7 … 7 6
2 9 6 7 6 8 9 3 3 9 5 7 2 9 6 7 6 8 9 3 3 9 5 7
P1 P2
… Pn P1 P2
… Pn
y1OOB y2OOB
5 2
faster 2 9 6 7
• X-Partitioner → X-Aggregator
• X-Partitioner replaces Partition
• X-Aggregator replaces Scorer
• Can be used with any learner/predictor
Scoring
Starting point:
create local H2O
context
Database type
defines SQL dialect
Columns to update