Skip to content

Filter

Filter Table

Filters rows in a TableData object using a pandas query string.

Details

Examples:

  • Area > 100 — keep rows where Area is greater than 100
  • Area > 50 and Circularity > 0.8 — multiple conditions
  • Group == "Control" — match a specific text value
  • Group != "Background" — exclude rows
  • Area > Area.mean() — compare to column statistics
  • label in [1, 2, 5] — match specific values from a list

Uses pandas DataFrame.query() syntax. Column names with spaces need backticks: `Column Name` > 10

Direction Port Type
Input in table
Output out table

Top N

Extracts the top (or bottom) N rows ranked by a numeric column.

Details

Outputs:

  • top_n — the selected N rows
  • rest — all remaining rows not in top_n

Parameters:

  • Rank By Column — numeric column to rank by
  • N — number of rows to select
  • SelectTop (largest) or Bottom (smallest)
Direction Port Type
Input in table
Output top_n table
Output rest table

Properties: N (number of rows), Select


Column Value Split

Splits a table into two outputs based on whether a column's value matches a list of specified values.

Details
  • Values — comma-separated. * anywhere triggers glob matching:
  • Control* — starts with "Control"
  • *treated — ends with "treated"
  • *GFP* — contains "GFP"
  • Entries without * are exact matches

Outputs:

  • matched — rows where the column value matches any entry
  • rest — all other rows
Direction Port Type
Input in table
Output matched table
Output rest table

Properties: ``


Random Sample

Randomly samples N rows from the input table.

Details

If N exceeds the table size, the full table is returned (no error).

Parameters:

  • N — number of rows to draw
  • Seed — random seed for reproducibility; leave at -1 for a different sample each run

Outputs:

  • sampled — the N randomly selected rows
  • rest — all remaining rows not in the sample
Direction Port Type
Input in table
Output sampled table
Output rest table

Properties: N (rows to sample), Random Seed (-1=random)


Drop Duplicates

Removes duplicate rows from a table.

Details

Parameters:

  • Subset Columns — comma-separated columns to consider when checking for duplicates. Leave empty to compare all columns.
  • Keep — which duplicate to keep: first occurrence, last, or none

Outputs:

  • unique — rows after removing duplicates
  • dropped — the removed duplicate rows
Direction Port Type
Input in table
Output unique table
Output dropped table

Properties: Keep