Here we describe four alternative configurations for the SigWin-detector workflow. Configurations Basic1 and Basic2 are used in the case we want to compute one single SiqWin-map for the entire input data sequence. Configurations Sub1 and Sub2 are used in the case that the input data sequence can be subdivided into logical subsequences and we want to present the output in terms of those subsequences, thus one SigWin-map per subsequence. For instance, a transcriptome map has chromosomes as logical subsequences; a time series may have years, months, days, etc, as logical subsequences. Configurations ending in 1 only compute one significant window type: windows with significantly high median values or windows with significantly low median value. Configurations ending in 2 compute both significant window types and plot them with different colors.
SigWin-detector
Config-Basic: Detects significant windows in a sequence.
SigWin-detector
Config-Sub: Detects significant windows in a subsequence.
1 A file containing the detected significant windows for each label. Each data row represents a stretch of consecutive significant windows. Column 1 gives the window size and columns 2 and 3 give the first and last significant windows in the stretch. Output file example.
2 An XMGRACE configurations file with information on how to plot the resulting SigWin-map (Use: xmgrace -bat file.agrcmd)

ColumnReader: file_name, column.
SWMedian: min_window_size, max_window_size, step_size.
FDRThresold: FDR_level, threshold.
SigWinSelect: Output_file, threshold.
1 A file containing the detected significantly high median windows for each label. Output file example.
2 A file containing the detected significantly low median windows for each label. Output file example.
3 An XMGRACE configurations file with information on how to plot the SigWin-maps. (Use: xmgrace -bat file.agrcmd)

ColumnReader: file_name, column.
SWMedian: min_window_size, max_window_size, step_size.
FDRThresold (upper): FDR_level, threshold=high. FDRThresold (lower): FDR_level, threshold=low.
SigWinSelect (upper): Output_file, threshold=high. SigWinSelect (lower): Output_file, threshold=low.
StringJoiner: n_streams=2
1 A file containing the detected significant windows for each label. Output file example.
2 An XMGRACE configuration file with information on how to plot the resulting SigWin-maps (Use: xmgrace -bat file.agrcmd)

Read2Clumns: file_name, column1, column2.
SWMedian: min_window_size, max_window_size, step_size.
FDRThresold: FDR_level, threshold.
SigWinSelect: Output_file, threshold.
1 A file containing the detected significantly high median windows for each label. Output file example.
2 A file containing the detected significantly low median windows for each label. Output file example.
3 An XMGRACE configurations file with information on how to plot the SigWin-maps. (Use: xmgrace -bat file.agrcmd)

Read2Clumns: file_name, column1, column2.
SWMedian: min_window_size, max_window_size, step_size.
FDRThresold (upper): FDR_level, threshold=high. FDRThresold (upper): FDR_level, threshold=low.
SigWinSelect (lower): Output_file, threshold=high. SigWinSelect (lower): Output_file, threshold=low.
StringJoiner: n_streams=2
A short description of each module containing the modules functionality, most used parameters, input ports and output ports is given below. Only the parameters that must be set before execution are listed (the other parameters may keep their default value). The port connections are given in the workflow diagrams. The contents of the input ports are the same as the contents of the corresponding output ports. The ports are named by an abbreviation of the module name followed by 'i' or 'o' (input or output respectively) and the port number. Input ports are colored in blue and output ports in red. The ports are numbered in the same order as they appear in the workflow diagrams. A detailed description of the modules is given here.
Module
functionality |
Parameters |
Input ports |
Output ports
description |
|
ColumnReader: Reads the input sequence E= {E1, E2, …, EN}from a tab delimited file and transfers it to the output port. |
file_name: Input file name column: Column number |
CRi1 (Not used) |
CRo1: A vector containing the
input sequence E |
|
Read2Columns: Reads two columns from a tab delimited file and transfers it to the output ports. Column A: a sequence of labels L= {L1, L2, …, LN}. Column B: the input sequence E= {E1, E2, …, EN}. |
file_name: Input file name column1: Column A number column2: Column B number |
R2Ci1 (Not used) |
R2Co2: A vector containing the
labels L. R2Co1: A vector containing the
input sequence E |
|
SeqSplitter: Computes labeled subintervals from L. A labeled subinterval is determined by consecutive labels with the same value. E.g., L={a, a, b, b, b} produces two subintervals I1= (a, 1, 2) and I2=(b, 3, 5), defined by the triple (label, start, end). |
- |
SSPi1 |
SSPo1: A vector containing the
labeled subintervals I={I1, I2, …}. |
|
Rank: Computes the ranks R= {R1, R2, …, RN} corresponding to E. |
– |
Ri1 |
Ro1: The Rank structure
corresponding to E. Ro2: A vector containing R, a sorted version of E. Ro3: A vector containing a sorted version of the non
duplicate values of E. |
|
SWMedian: Computes mS(w), the moving medians of E, for window sizes S = Smin, Smin+DS,
…, Smax =Smin+qDS. |
min_window_size: Smin (odd) max_window_size: Approximate value for Smax step_size: DS (even) |
SWMi1 |
SWMo1: The
parameters SW=(N, Smin, Smax, DS) corresponding to the
sliding window structure. SWMo2: A
sliding window structure containing the computed moving medians (i.e., a
sequence of vectors. Each containing mS(w), for S = Smin, Smin+DS,
…, Smaz). |
|
SWSplitter: Splits the input sliding window structure.
Each new substructure corresponds to a labeled subinterval. E.g., sliding
window SW=(N=30, Smin,=1, Smax =15, DS=2) and intervals I1=
(a, 1, 12) and I2=(b,
13, 30) produce sliding window substructures SW1=(12, 1, 11, 2)
and SW1=(18, 1, 15, 2). |
- |
SWSPi1
|
SWSPo1: A
vector containing the parameters for each sliding window substructure SW={SW1, SW2,
…}. SWSPo2: The
corresponding sliding window substructures |
|
SWMedianProb: Computes fS(m), the
exact theoretical null hypothesis probability density function corresponding
to the moving medians mS(w). |
|
SWMPi1
|
SWMPo1: A
sequence of vectors. Each containing fS(m), |
|
Sample2Freq: Generates gS(m), the
normalized frequency counts corresponding to the moving medians mS(w). |
|
S2Fi2
S2Fi1 |
S2Fo1: A
sequence of vectors. Each containing gS(m), |
|
FDRThreshold: Uses gS(m) and fS(m) to
compute mk,S (or mj,S), the high (or low)
mmFDR thresholds at a given level α. corresponding to each window size S, for S = Smin, Smin+DS,
…, Smin+qDS. |
FDR_level: α (0 to 1) Threshold: high or low |
FDRTi1
|
FDRTo1: A
sequence of high (or low) mmFDR thresholds mk,S (or mj,S),
|
|
SigWinSelect: Selects the windows for which the median
value mS(w) is above (or below) the FDR
threshold mk,S (or mj,S). The resulting
significant windows are written to a tab-delimited file. |
Output_file: output file name. Threshold: high or low |
SWSi1 |
SWSo1: Name
of the file to which the resulting significant windows were written. SWSo1: (Not
used) |
|
StringJoiner: Concatenate up to four strings and send
then to the output port. In the SigWin-detector config-HT workflow: the
strings are the names of the files containing the detected significantly high
median windows and the significantly low median windows respectively. |
n_streams: number of input ports to use (1 to 4). |
SJi1 SJi3 (Not used) SJi4(Not used) |
SJo1: The
concatenated string. |
|
SigWinPlotGrace Generates an XMGRACE [13] configuration file with instructions of how to plot the
resulting SigWin-map. |
|
SWPGi1 |
SWPGo1: A
file containing XMGRACE instructions on how to print |
|
XmGrace Displays the resulting SigWin-map using XMGRACE. |
|
XMGi1 |
– |
#size=26740 ncols=2 descr=hg18-htm
#chrom expression
1 10.0
1 13.0
1 286.0
1 0
...
1 46.0
2 37.0
2 5.0
2 77.0
...
24 7.0
24 74.0
24 96.0
24 1.0
#Windows beneath FDRThreshold for hg18-htm:1
#printing a point at wMin to get correct number of sets
1 1 1
#windowSize first last
25 210 221
25 994 996
27 212 220
...
#Windows beneath FDRThreshold for hg18-htm:2
#printing a point at wMin to get correct number of sets
1 1 1
#windowSize first last
29 18 21
47 30 31
49 29 32
51 29 32
...