You can only upload sequences if you have created an account and are part of a group.
Before you begin this process, you should ensure your data is in the correct format. Each sequence should have a unique submissionID that can be used to associate it with its corresponding metadata entry.
Loculus expects:
fasta
format with a unique submissionID per sequence.tsv
format for each sequence. You can read more about the required metadata fields here. If you need help formatting metadata we offer a metadata template for each organism on the submission page.The files can also be compressed: accepted formats are .zst, .gz, .zip and .xz.
You can try out uploading sequences to our Demo Instance - it works just like the ‘real’ Pathoplexus, but is wiped regularly and no data is sent onward to INSDC. We also have some example data you can upload to the Demo Instance.
Pathoplexus expects multi-segmented pathogens to have one unique submissionID per isolate (pathogen sample containing all segments). However, fasta
files should still have a separate entry/record per segment. Therefore, each record id should include the unique submissionID of the isolate and the segment name, for example: submissionID + '_' + segmentName
. The metadata is uploaded per isolate, i.e. there will be only one row for each submissionID
and segmented metadata parameters need to be uploaded individually, i.e. under length_{segmentName}
etc.
Uploading sequences via the website is a very way to submit sequences without having to worry about any code.
fasta
file with the sequences and a metadata file with the associated metadata into the box on the website, or click the ‘Upload a file’ link within the boxes to open a file-selection boxThe data will now be processed, and you will have to approve your submission before it is finalized. You can see how to do this here.
To use the demo instance instead of the main instance, please replace backend.pathoplexus.org
with backend-demo.pathoplexus.org
.
By using our API you agree to our Data Use Terms.
It is currently possible to upload sequences through an HTTP API. We also plan to release a command-line interface before the end of 2024.
To upload sequences through the HTTP API you will need to:
To upload sequences with the open use terms: https://backend.pathoplexus.org/<organism>/submit?groupId=< group id>&dataUseTermsType=OPEN
To upload sequences with the restricted use terms: https://backend.pathoplexus.org/<organism>/submit?groupId=<group id>&dataUseTermsType=RESTRICTED&restrictedUntil=<restricted-until-date>
The available values for the organism are: cchf
, ebola-sudan
, ebola-zaire
and west-nile
.
The restricted-until date must be provided in the ISO format (e.g., 2024-08-27).
The header should contain
Authorization: Bearer <authentication-token>
Content-Type: multipart/form-data
The request body should contain the FASTA and metadata TSV files with the keys sequenceFile
and metadataFile
With cURL, the corresponding command for sending the POST request can be:
curl -X 'POST' \
'https://backend.pathoplexus.org/<organism>/submit?groupId=<group id>&dataUseTermsType=OPEN' \
-H 'accept: application/json' \
-H 'Authorization: Bearer <authentication token>' \
-H 'Content-Type: multipart/form-data' \
-F 'metadataFile=@<metadata file name>' \
-F 'sequenceFile=@<fasta file name>'
Further information can be found in our Swagger API documentation.
As with the website, data will now be processed, and you will have to approve your submission before it is finalized. You can see how to do this here.