Binary repository manager
4 stars based on
We process all components of your study, including the samples, project description, processed data files, and we submit the raw data files to the Sequence Read Archive SRA on your behalf.
Once you have determined that GEO is an appropriate resource for your data type see categories of data we do and do not acceptdata should be submitted using the GEOarchive spreadsheet format, instructions below.
Alternatively, if your metadata are already in a database, and you can generate and export data in SOFT text format, you may prefer to use SOFT format. GEOarchive has three required components: Details binare gen depository each component are described below. Download metadata spreadsheet template and examples. Metadata refers to descriptive information about the overall study, individual samples, all protocols, and references to processed and raw data file names.
Information binare gen depository supplied by completing all fields of a metadata template spreadsheet. Guidelines on the content of each field are provided within the spreadsheet. Processed data are a required part of GEO submissions.
The final processed data are defined as the data on which the conclusions in the related manuscript are binare gen depository. We do not expect standard alignment files e. When standard alignments are the only processed data available, please write to us to inquire about binare gen depository your data are suitable for submission to GEO. Requirements for processed data files are not fully standardized and will depend on the nature of the experiment:.
The reference assembly used e. A description of the format and content of processed data files should be provided in the metadata spreadsheet data processing fields. Raw data are a required part of GEO submissions. We will submit raw data files to SRA for you. Raw Data File Formats: Files that do not conform to supported format requirements will be deleted from our systems.
Submitters are required to de-multiplex their raw data files prior to submission so that each barcoded sample ends up with a dedicated run file. Reads should not be trimmed. Include barcode information in the "library construction protocol" field of the metadata spreadsheet. We usually expect binare gen depository files per run 4 files per run when sequences and qualities are included in separate files.
Submitters should provide the average insert size of the molecules sequenced binare gen depository linkers, adapters, etc Edited files may not be binare gen depository correctly by SRA. We recommend that submitters provide MD5 checksums for their raw data files. The checksums are used to verify file integrity. Checksums can be calculated using the following methods: Many are available for free download. Individual files can be compressed to speed transfer, but this is not required.
Acceptable compression formats are gzip and bzip2 i. Never compress binary files e. If you have any questions or concerns regarding data transfer, please e-mail us. All standard GEO administration and processing procedures apply to sequence submissions. Studies concerning quantitative gene expression, gene regulation, epigenetics, or other functional genomic studies. If you plan to submit genomic data from human specimens that would not be considered large-scale, it is your responsibility to ensure binare gen depository the submitted information does not compromise participant privacy and is in accord with the original consent in addition to all applicable laws, regulations, and institutional policies.
It is your responsibility to ensure that the submitted information does not compromise participant privacy and is in accord with the original consent in addition to all applicable laws, regulations, and institutional binare gen depository. The sponsor would create a Data Access Request and Use Certification and define use restrictions for use in approving data access requests. Submitting high-throughput sequence data to GEO Assembling your submission Metadata spreadsheet Processed data files Binare gen depository data files Uploading your submission General Information Data provisions, standards and administration Categories of sequence submissions accepted by GEO Guidelines for human genomic data submitted to unrestricted-access binare gen depository Assembling your submission Back to top GEO accepts next generation sequence data that examine quantitative gene expression, gene regulation, epigenomics or other aspects of functional genomics using methods such as RNA-seq, miRNA-seq, ChIP-seq, RIP-seq, HiC-seq, methyl-seq, etc.
Metadata spreadsheet Download metadata spreadsheet template and examples Metadata refers to descriptive information about the overall study, individual samples, all protocols, and references to processed and raw data file names. Processed data files Processed data are a required part of GEO submissions. Requirements for processed data files are not fully standardized and will depend on the nature of the experiment: Expression profiling analysis usually generates quantitative data for features of binare gen depository. Features of interest may be genes, transcripts, exons, miRNA, or some other genetic entity.
Two levels of data are often generated: Either or both of these data types may be supplied as processed data. They may be formatted either as a matrix table or individual files for each sample. ChIP-Seq data might include peak files with quantitative data, tag density files, etc.
Raw data files Raw data are a required part of GEO submissions. Uploading your submission Back to top Before you upload: If you plan to submit more than 1 terabyte of data, you must binare gen depository GEO with a list of files and MD5 checksums before you begin transferring files if your files are compressed, the checksums should be for the compressed files. This will allow us to more quickly remove binare gen depository from the FTP server in order to keep space available for other users.
Failure to notify us or provide checksums may lead to your files being removed from our systems without processing. Do not transfer files unless you are confident that you have a submission that includes all required components raw data files, processed data files and metadata spreadsheet. We do not have the resources to store incomplete submissions. Incomplete submissions will be deleted from our systems.
Transfer the folder using the FTP instructions below. We strongly recommend that submitters compress their raw data files e.
Do not compress with WinZip. Do not tar archive single files. Do not compress binary files e. After you upload, send us an email notification: After file transfer is complete, you must e-mail GEO with the following information: We do not send automated confirmation that files have been received. You should expect to receive an e-mail from a curator within 5 business days after you send us the notification see FAQ. Thorough descriptions of the biological samples under investigation, and procedures to which they were subjected Thorough descriptions of the binare gen depository used to generate and process the data Final processed or summary data from which the conclusions in associated manuscripts are based Original raw data files containing sequence reads and quality scores, which will be uploaded to NCBI's Sequence Read Archive SRA database.
Administration All standard GEO binare gen depository and processing procedures apply to sequence submissions. Unique and stable GEO accession numbers are issued to studies; these accessions can be cited in manuscripts GEO accession numbers are typically issued within 5 business days after completion of submission Data can be held private until publication Reviewers can have anonymous access to private binare gen depository Submitters can update their records at any time More information on these aspects is provided in our FAQ.
Categories of sequence submissions processed by GEO Back to top GEO accepts GEO does not accept Studies concerning quantitative gene expression, gene regulation, epigenetics, or other functional genomic studies. Guidelines for human genomic data submitted to unrestricted-access repositories Binare gen depository to top NIH-funded studies: