ContentsPrerequisitesPlease Note: Working with MinKNOW Run Data requires the Long-read Data Plasmid Transmission Analysis Module In order for the MinKNOW run data to be processed further in SeqSphere, the setting Trim barcodes: On must be used. MinKNOW stores each run in a directory. A fastq_pass subdirectory contains itself barcodeXY subdirectories with multiple fastq.gz files. The file report_RUNNAME.json contains detail information about the run. Pipeline settingsWhen Oxford Nanopore MinKNOW hard disk is selected as Input Source Type an additional input field is available: Output FASTQ directory. The FASTQ-files for a MinKNOW sample are merged together and stored in a subdirectory of Output FASTQ directory that is named by the run id, e.g. 20241209_1142_MN30839_FBA87915_d4fdb8bb. Therefore this directory must be writeable for the current user. Note that Field Terminator and Field Delimiter cannot be edited when Oxford Nanopore MinKnow hard disk is selected and that the file preview is disabled. Running the pipelineA subdirectory named by the run ID is created in the output FASTQ directory. Merged FASTQ files, the report file and a sequence specification file (sequence_specification.spec) are written to this subdirectory. FASTQ files for a run that already has a subdirectory below the output FASTQ directory will be ignored by the pipeline. Sample SheetA sample sheet is used to assign a sample name to the barcodes. The sample sheet is stored in a file sample_sheet_seqsphere.csv or sample_sheet_seqsphere.xlsx in the ONT run directory. This file can be created manually or by using the sample sheet input window. The sample sheet file must contain at least two columns (with headers). The first column is used for barcodes (e.g. barcode01 ... barcode99), the second column contains the sample name. All other columns are ignored. The column headers can contain any name. Sample Sheet Input WindowIf a ONT run directory does not contain a file sample_sheet_seqsphere.csv or sample_sheet_seqsphere.xlsx and no run subdirectory below the output directory exists a sample sheet input window is opened when the pipeline is started. The window can contain several tabs if the sample data has to be entered for several runs. Enter the Sample names and choose the corresponding barcodes. The sample name may only contain the characters a-z, A-Z, 0-9 and -. After clicking OK in the sample sheet window the file sample_sheet_seqsphere.csv is written into the ONT run directory. The run directory must be writable by the Windows user for this to work. If the sample sheet input window is open and the file sample_sheet_seqsphere.csv in the run directory is written by another computer (e.g. on a network drive) the sample sheet for this run is removed from the window. If there are no more sample sheets to enter for the other directories, the window will close.
Pipeline in Continuous ModeUsing a pipeline in continues mode (see page Pipeline Script for details), further processing of the data generated by MinKnow can be started together with the sequencing.
Procedure DetailsInformation about the MinKNOW run are stored an a Sample's procedure details, e.g. the used basecaller and assembler. Information about the sequencing run can be accessed by clicking the cell with the Sequencing Run ID. |