Specify allowed source and coverted file extensions
Context
NCBI supports three content conversion workflows. There is some variation in which files extensions are allowed for each. This proposal updates the existing rules. This is documented as user story 11 in the BCMS scoping outcomes list
Proposal
Support file uploads with the following extension rules
Workflow | Source file extension | Converted file extension |
---|---|---|
Word WF |
.doc ; .docx
|
.xml ; .bxml
|
PDF WF |
.doc ; .docx ; .pdf ; .xlsx ; xml
|
.xml ; .bxml
|
XML |
.xml ; doc ; .docx ; pdf ; xlsx
|
.xml ; .bxml
|
For chapter-processed books the current rule for filename matching does not change. We still expect that file version 1 (filename.doc
) may have a different extension that file version 2 (filename.docx
)
Dev tasks
-
updated supported extensions per workflow -
update the UI validations and error messages
Related notes from scoping session (Thursday 19 May)
The tagging structure of a .bxml
file will be the same as the specifications we already support for .xml
therefore no updates are required for parsing the metadata.
QA Steps
- On the chapter processed types of books on all workflows, from bulk upload files with the extensions expected as above & also files with different extensions other than that. For extensions not accepted you will see a validation message.
- Source files from files tab. On all workflows when you upload a second file with same filename and different extension if status new upload it will replace the first one, if status is other it will create a new version for the file. (this needs further work in #1278 (closed) which is still in progress, so skip this QA step for this issue)
- Source files for wholebooks on pdf & xml. Bulk upload more than one file with different extensions which are expected to be there.
- Converted file can be both .bxml and .xml like described in issue #1176 (closed)
- Word one doc files, the rules are as they where before. #215 (closed) + that it allows to upload toc.bxml as well.
- For all workflows, if the source/converted file has more than a dot in the filename, it can still be uploaded like for example filetest.bm.xml