fix #6 #7 #8

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Open

antgonza wants to merge 5 commits into qiita-spots:main from antgonza:fix-6-7

Member

antgonza commented Nov 24, 2025

No description provided.

fix

c8f4e9c

antgonza mentioned this pull request

Create a MANIFEST.txt file, listing all the files in the folder. #7

Open

antgonza added 3 commits

November 24, 2025 07:08


          files.sort()

2b0b8bb


          debug

d5e53c9


          avoid duplicated links

b1abc8f

sjanssen2 reviewed

View reviewed changes

sjanssen2 left a comment

I generally like the idea and the cleanup of the existing test. I would love to see instructions on what the user can do with the information provided in the MANIFEST, i.e. how to obtain these files.

README.rst

Comment on lines +9 to +12

    
              - `summary.html`: a browser friendly file listing that will include all files at `[artifact-id]/[output-folder]` and

                any `index.html` files in any subfolder. As a reminder, the Qiita nginx basic configuration allows to display/load any

                html/JS available files; thus, able to display properly `index.html` files available

              - `MANIFEST.txt`: a comprehensive list of all available files in the folder.

sjanssen2 Nov 26, 2025

from the wording, it is not 100% clear to me if

"will include all files at [artifact-id]/[output-folder]" includes recursion through all sub-directories
"Qiita nginx basic configuration allows to display/load any html/JS available files" which lines in the example configuration refers to this? Can you point to those?
which class of user can make use of the MANIFEST.txt. Can he/she download the whole directory as a ZIP archive?

README.rst

    
              The two main plugins using this output are:

              - https://github.com/qiita-spots/qp-knight-lab-processing: which will generate an `[output-folder]` contaning all the logs,

                files and summaries from BCL to clean FASTQ processing. Note that multiqc resoults are part of this and the outputs are

sjanssen2 Nov 26, 2025

typo

sjanssen2 Nov 26, 2025

Can you add which user class can run this command? I figure it is only admins but no normal users, right?

qtp_job_output_folder/tests/test_summary.py Outdated

Comment on lines 9 to 15

    
              from unittest import main

              from tempfile import mkdtemp

              from os import remove

              from os.path import exists, isdir, join, dirname, abspath

              from inspect import currentframe, getfile

              from shutil import rmtree, copytree

              from json import dumps

              from os import remove

              from os.path import abspath, dirname, exists, isdir, join

              from shutil import copytree, rmtree

              from tempfile import mkdtemp

              from unittest import main

sjanssen2 Nov 26, 2025

hm, your editor seems to have flipped ordering of imports. Makes it harder to see actual differences :-/

qtp_job_output_folder/tests/test_summary.py

Comment on lines 73 to 75

    
                          self._clean_up_files.extend([ff["filepath"]])

                          for f in res["files"].values()

                          for ff in f

sjanssen2 Nov 26, 2025

my intuition says that shorter variable names are inner elements. Here f is a list and ff a file-dict.
With three lines, we have more space, therefore I suggest to change f into files and ff into file

qtp_job_output_folder/tests/test_summary.py Outdated

Comment on lines 81 to 87

    
                      print("-------------")

                      print("-------------")

                      print(html)

                      print("-------------")

                      print(EXP_HTML.format(aid=aid))

                      print("-------------")

                      print("-------------")

sjanssen2 Nov 26, 2025

leftover from debugging?

qtp_job_output_folder/summary.py

    
                  separator = "|--"

                  for dpath, _, files in walk(folder):

                      # assuring same order, mainly for testing

                      files.sort()

sjanssen2 Nov 26, 2025

is this inplace sorting?

qtp_job_output_folder/summary.py

    
                              index.append(("file", f"{dpath}/{f}"))

                      # if we are not at the top, we should only add

                      # the index.html files

                      elif "index.html" in files:

sjanssen2 Nov 26, 2025

do we have to take care for different case spelling, e.g. a file with name Index.html?

qtp_job_output_folder/summary.py

    
                      elif "index.html" in files:

                          index.append(("file", f"{dpath}/index.html"))

                      depth = dpath.replace(folder, "").count(sep)

sjanssen2 Nov 26, 2025

is folder relative or absolute? What if the path contains the folder infix twice like job-output/4/job-output/4/subdir?

qtp_job_output_folder/summary.py

    
                      with open(manifest_fp, "w") as of:

                          of.write("\n".join(manifest))

                      links = [link % (manifest_fp[tlink:], "file", manifest_fp[tname:])]

sjanssen2 Nov 26, 2025

would https://www.geeksforgeeks.org/python/python-os-path-relpath-method/ be a safer solution? Instead of performing string operations here?

qtp_job_output_folder/summary.py Outdated

Comment on lines 58 to 62

    
                      for ft, f in index:

                          # to avoid any duplication of lines:

                          _link = link % (f[tlink:], ft, f[tname:])

                          if _link not in links:

                              links.append(_link)

sjanssen2 Nov 26, 2025

wouldn't a sorted(list(set())) do?

sjanssen2 suggested changes

View reviewed changes

sjanssen2 left a comment

When looking closer at the currently failing test, I see your debug info:

-------------
<a href="./10/test_data/MANIFEST.txt" type="file" target="_blank">test_data/MANIFEST.txt</a><br/>
<a href="./10/test_data/file_1" type="file" target="_blank">test_data/file_1</a><br/>
<a href="./10/test_data/file_2" type="file" target="_blank">test_data/file_2</a><br/>
<a href="./10/test_data/folder_a/folder_b/index.html" type="file" target="_blank">test_data/folder_a/folder_b/index.html</a><br/>
<a href="./10/test_data/folder_1/index.html" type="file" target="_blank">test_data/folder_1/index.html</a><br/>
<a href="./10/test_data/test_data/folder_a/folder_b/index.html" type="file" target="_blank">test_data/test_data/folder_a/folder_b/index.html</a><br/>
<a href="./10/test_data/test_data/folder_1/index.html" type="file" target="_blank">test_data/test_data/folder_1/index.html</a>
-------------
<a href="./10/test_data/MANIFEST.txt" type="file" target="_blank">test_data/MANIFEST.txt</a><br/>
<a href="./10/test_data/file_1" type="file" target="_blank">test_data/file_1</a><br/>
<a href="./10/test_data/file_2" type="file" target="_blank">test_data/file_2</a><br/>
<a href="./10/test_data/folder_a/folder_b/index.html" type="file" target="_blank">test_data/folder_a/folder_b/index.html</a><br/>
<a href="./10/test_data/folder_1/index.html" type="file" target="_blank">test_data/folder_1/index.html</a>
-------------

and assume the above is the observation, below is expectation.

Why do you observe ./10/test_data/test_data/folder_1/index.html? The test_data part should not be repeated?! As the provided test_data does NOT contain itself.


          pull main

f7bf36a

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet