Skip to content

Large export is not re-importable (ImportError: You tried to import too many files, max. is 1000) #48

@moi90

Description

@moi90

This project consists of many samples (4060 to be exact):
https://ecotaxa.obs-vlfr.fr/prj/6433

When exporting with exp_type=BAK, split_by=S, the archive contains as many individual TSV files.
(Guessing from the UI, split_by should be ignored when doing a BAK export, but I consider this a feature.)

However, when re-importing the same data, I get an import error:

 	You tried to import too many files, max. is 1000

    File "/usr/lib/python3.8/threading.py", line 890, in _bootstrap self._bootstrap_inner()
    File "/usr/lib/python3.8/threading.py", line 932, in _bootstrap_inner self.run()
    File "/app/BG_operations/JobScheduler.py", line 40, in run sce.run_in_background()
    File "/app/API_operations/helpers/JobService.py", line 73, in run_in_background self.do_background()
    File "/app/API_operations/imports/Import.py", line 76, in do_background self.do_validate()
    File "/app/API_operations/imports/Import.py", line 115, in do_validate how, diag, nb_rows = self._collect_existing_and_validate(source_dir_or_zip, loaded_files)
    File "/app/API_operations/imports/Import.py", line 142, in _collect_existing_and_validate source_bundle = InBundle(source_dir_or_zip, bundle_temp_dir)
    File "/app/BO/Bundle.py", line 54, in __init__ one_more()
    File "/app/BO/Bundle.py", line 49, in one_more raise ImportError("You tried to import too many files, max. is %d" % self.MAX_FILES)
    ImportError: You tried to import too many files, max. is 1000

This limitation seems somewhat arbitrary and I think, EcoTaxa should be able to read the data it itself has emitted.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions