Skip to content

Commit bda13f8

Browse files
Merge pull request #3252 from antgonza/2023.02-review
2023.02 review
2 parents e1ff4dc + 628faea commit bda13f8

File tree

12 files changed

+79
-19
lines changed

12 files changed

+79
-19
lines changed

.github/workflows/qiita-ci.yml

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -108,9 +108,9 @@ jobs:
108108
- name: Install plugins
109109
shell: bash -l {0}
110110
run: |
111-
wget https://data.qiime2.org/distro/core/qiime2-2019.4-py36-linux-conda.yml
112-
conda env create --quiet -n qtp-biom --file qiime2-2019.4-py36-linux-conda.yml
113-
rm qiime2-2019.4-py36-linux-conda.yml
111+
wget https://data.qiime2.org/distro/core/qiime2-2022.11-py38-linux-conda.yml
112+
conda env create --quiet -n qtp-biom --file qiime2-2022.11-py38-linux-conda.yml
113+
rm qiime2-2022.11-py38-linux-conda.yml
114114
export QIITA_SERVER_CERT=`pwd`/qiita_core/support_files/server.crt
115115
export QIITA_CONFIG_FP=`pwd`/qiita_core/support_files/config_test.cfg
116116
export REDBIOM_HOST="http://localhost:7379"

CHANGELOG.md

Lines changed: 18 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,23 @@
11
# Qiita changelog
22

3+
Version 2023.02
4+
---------------
5+
6+
* Added "Software and Data Licensing" to the Qiita FAQ page.
7+
* External resources can now add new sample metadata categories to a study via
8+
the `/api/v1/study/` endpoint.
9+
* Added preparation-id to the GUI list of artifacts used in an analysis.
10+
* Added automatic lower-casing to INSDC null values [#3246](https://github.com/qiita-spots/qiita/issues/3246).
11+
* ArtifactHandler now returns the file size and full path of the files available in
12+
an Artifact. This change had two consequences: (1) the plugins now can control
13+
their behavior based on the file sizes, and (2) all plugins had to be updated to use this new feature.
14+
* Added [qiita_client.artifact_and_preparation_files](https://github.com/qiita-spots/qiita_client)
15+
to help plugins filter per_sample_FASTQ based on size and ignore small file sizes.
16+
* Added qiime2.2022.11 to the system; which updated these plugins: qp-qiime2, qtp-biom, qtp-diversity, qtp-visualization.
17+
* Added q2-mislabeled to the qiime2.2022.11.
18+
* Added support for Amplicon data to the internal [Sequence Processing Pipeline](https://github.com/qiita-spots/qp-knight-lab-processing).
19+
20+
321
Version 2022.11
422
---------------
523

qiita_core/__init__.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -6,4 +6,4 @@
66
# The full license is in the file LICENSE, distributed with this software.
77
# -----------------------------------------------------------------------------
88

9-
__version__ = "2022.11"
9+
__version__ = "2023.02"

qiita_db/__init__.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -27,7 +27,7 @@
2727
from . import user
2828
from . import processing_job
2929

30-
__version__ = "2022.11"
30+
__version__ = "2023.02"
3131

3232
__all__ = ["analysis", "artifact", "archive", "base", "commands",
3333
"environment_manager", "exceptions", "investigation", "logger",

qiita_db/metadata_template/base_metadata_template.py

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -608,7 +608,8 @@ def _clean_validate_template(cls, md_template, study_id,
608608
for c in _ddf.columns:
609609
if set(INSDC_NULL_VALUES) & set(_ddf[c].values):
610610
for s, v in _ddf[c].to_dict().items():
611-
md_template[c][s] = INSDC_NULL_VALUES[v]
611+
if v in INSDC_NULL_VALUES:
612+
md_template[c][s] = INSDC_NULL_VALUES[v]
612613

613614
return md_template
614615

qiita_db/metadata_template/test/test_prep_template.py

Lines changed: 5 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -723,7 +723,10 @@ def test_clean_validate_template_error_duplicate_samples(self):
723723

724724
def test_clean_validate_template(self):
725725
PT = qdb.metadata_template.prep_template.PrepTemplate
726-
obs = PT._clean_validate_template(self.metadata, 2)
726+
# modify input to make sure we hit all cases
727+
md = self.metadata.copy()
728+
md.loc['SKB7.640196']['str_column'] = 'UnSpeciFied'
729+
obs = PT._clean_validate_template(md, 2)
727730
metadata_dict = {
728731
'2.SKB8.640193': {'center_name': 'ANL',
729732
'center_project_name': 'Test Project',
@@ -755,7 +758,7 @@ def test_clean_validate_template(self):
755758
'center_project_name': 'Test Project',
756759
'ebi_submission_accession': None,
757760
'emp_status': 'EMP',
758-
'str_column': 'Value for sample 3',
761+
'str_column': 'not applicable',
759762
'primer': 'GTGCCAGCMGCCGCGGTAA',
760763
'barcode': 'CCTCTGAGAGCT',
761764
'run_prefix': "s_G1_L002_sequences",

qiita_pet/__init__.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -6,4 +6,4 @@
66
# The full license is in the file LICENSE, distributed with this software.
77
# -----------------------------------------------------------------------------
88

9-
__version__ = "2022.11"
9+
__version__ = "2023.02"

qiita_pet/handlers/api_proxy/__init__.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -38,7 +38,7 @@
3838
from .user import (user_jobs_get_req)
3939
from .util import check_access, check_fp
4040

41-
__version__ = "2022.11"
41+
__version__ = "2023.02"
4242

4343
__all__ = ['prep_template_summary_get_req', 'data_types_get_req',
4444
'study_get_req', 'sample_template_filepaths_get_req',

qiita_pet/support_files/doc/source/faq.rst

Lines changed: 29 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -381,6 +381,35 @@ downloaded from `qiita.ucsd.edu <https://qiita.ucsd.edu/>` and redbiom, includin
381381
and metadata, are distributed under the BSD 3-Clause License.
382382

383383

384+
Some of the studies have `Qiita-EBI Import` as the PI, why is this?
385+
-------------------------------------------------------------------
386+
387+
These are studies that were downloaded to Qiita via `qebil <https://github.com/ucsd-cmi/qebil>`. If you want
388+
us to add your study, please send us an email.
389+
390+
Now, if you are wondering about the possible "Processing notes", here are their explanation:
391+
392+
- MISSING: One or more of the fastq files for your study were unavailable for download from
393+
EBI/ENA or the downloaded files were found to contain corrupt data and were excluded from our
394+
automatic association and processing. A list of the affected samples and their corresponding
395+
EBI/ENA ftp links can be found in the .MISSING. preparation information files in the Uploads
396+
section of this page. If you would like to attempt to manually download and/or correct the
397+
fastq files, please visit the linked EBI/ENA project page in the Study details and follow our
398+
instructions for manually associating and processing the files.
399+
400+
- TOOMANYREADS: One or more of the fastq files for your study were found to contain more read
401+
files than indicated by the single or paired-end read technology that EBI/ENA indicated was
402+
used for processing the sample. This is most likely the case for studies where index reads
403+
have been included in a separate file as part of the upload, however our automated system
404+
is unable to readily distinguish this. A list of the affected samples and their corresponding
405+
EBI/ENA ftp links can be found in the .TOOMANYREADS. preparation information files in the
406+
Uploads section of this page. If you would like to attempt to have these samples processed,
407+
please visit the linked EBI/ENA project page in the Study details and either a) follow our
408+
instructions for manually associating and processing the files here or b) email us to
409+
indicate that the study should be processed with the assumption that the first file
410+
associated with a samples is an index read file.
411+
412+
384413
How to cite Qiita?
385414
------------------
386415

qiita_pet/support_files/doc/source/processingdata/processing-recommendations.rst

Lines changed: 16 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -81,15 +81,24 @@ wetlab processing and we provide the following host references for your convenie
8181
Note that the command produces up to 6 output artifacts based on the aligner and database selected:
8282

8383
- Alignment Profile: contains the raw alignment file and the no rank classification BIOM table
84-
- Taxonomic Prediction - phylum: contains the phylum level taxonomic predictions BIOM table
85-
- Taxonomic Prediction - genus: contains the genus level taxonomic predictions BIOM table
86-
- Taxonomic Prediction - species: contains the genus level taxonomic predictions BIOM table
8784
- Per genome Predictions: contains the per genome level taxonomic predictions BIOM table
88-
- Per gene Predictions: Only WoLr1, contains the per gene level taxonomic predictions BIOM table
85+
- Per gene Predictions: Only WoLr1 & WoLr2, contains the per gene level taxonomic predictions BIOM table
86+
- KEGG Pathways: Only WoLr2, contains the functional profile
87+
- KEGG Ontology (KO): Only WoLr2, contains the functional profile
88+
- KEGG Enzyme (EZ): Only WoLr2, contains the functional profile
89+
8990

9091
.. note::
91-
Woltka provides easy transformations for the "per gene Prediction table" to generate functional
92-
profiles, `more information <https://github.com/qiyunzhu/woltka/blob/master/doc/wol.md#comparison>`_.
92+
Woltka 0.1.4 only produces per-genome, per-gene and functional profiles as we are moving
93+
to Operational Genomic Units (OGUs), which have higher resolution than taxonomic units
94+
for community ecology, and were shown to deliver stronger biological signals in
95+
downstream analyses. For more information please read: `Phylogeny-Aware Analysis of
96+
Metagenome Community Ecology Based on Matched Reference Genomes while Bypassing
97+
Taxonomy <https://journals.asm.org/doi/10.1128/msystems.00167-22>`_. To work on lower
98+
taxonomic levels (like species or genus) you can follow `these instructions
99+
<https://github.com/qiyunzhu/woltka/blob/master/doc/collapse.md#collapse-to-level>`_ and use
100+
this `lineages.txt <http://ftp.microbio.me/pub/wol2/taxonomy/lineages.txt>`_ file
101+
with your collapse command.
93102

94103
Aligners
95104
^^^^^^^^
@@ -245,7 +254,7 @@ Metatranscriptome processing
245254
Qiita currently has one active Metatranscriptome data analysis pipeline, as follows:
246255

247256
#. Ribosomal read filtering via `SortMeRNA <https://pubmed.ncbi.nlm.nih.gov/23071270/>`_; details below. This produces a `Ribosomal reads` and a `Non-ribosomal reads` artifact/
248-
#. Taxonomic profiling via Woltka; for more information see details above.
257+
#. Sequence profiling via Woltka; for more information see details above.
249258

250259
Sample processing guidelines for metatranscriptomic data
251260
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

0 commit comments

Comments
 (0)