Review workflow diagram: single doc XML book via FTP pathway
I've created a diagram to show the flow of a single document XML book via FTP pathway.
Note that a "single document XML book" means that a book's text is provided in one XML file, but there are other files included in the FTP upload, as in the example screenshot:
Example FTP files
I've added this example book to NCBI/client-shared/sample-content/ukhsdr0807_2020-02-06_08-37-23.tar.gz
Here's the workflow diagram:
Queries to resolve
I've noted queries in orange boxes. Some of these came up during our recent design session at NCBI; others are new queries I had while drawing the workflow. We need to resolve these with NBCI, but first, please:
- prioritise the queries in order or urgency -- what do you need to know asap to continue current work versus what can wait and for how long.
- provide your feedback on workflow and the queries
- let me know what other queries you have on this workflow
Notes on queries
Query 2.1: What kind of unique identifier should be used?
The example file has this book ID information:
<book-id book-id-type="pmcid">ukhsdr0807</book-id> <!--publisher-id: 12/209/27--> <book-id book-id-type="doi">10.3310/hsdr08070</book-id> <book-title-group>
We should not assume that this will be the ID going forward. The files provided here are for NCBI's current FTP workflow (according to these general XML file submission specifications) which may differ from the specifications going forward.