Pass NCBI-specific metadata (domain name, collection id, collection title, source type, publisher name, publisher location) to NCBI for XML conversions
Expected behaviour
XML requirements
See expected XML Conversion requirements here:
wholebook processed content
Coko will pass the following pieces of NCBI metadata to XML conversion and PDF tagging:
- domain name (required)
- collection id (optional)
- collection title - TBD whether this can be a string or requires formatting and needs to be passed during Load to PMC
- source type (required)
- publisher name (optional)
- publisher location (optional)
All other data or metadata is present in the source XML.
{
"job_id": 9876543210, // Reference ID for the XML conversion job. It's generated by Coko and used by NCBI to report back the status of the conversion.
"user_name": "jordandc", // Name of a user initiating a conversion
"main_xml": "978-3-030-47318-1_Book.xml" // Filename of the primary XML file to use as the conversion entry point.
"domain": "spr9783030473181", // PMCBook domain name of the book being converted
"collection_id": "wtcollect", // PMCBook domain name of the collection that the book being converted belongs to (OPTIONAL)
"collection_title": "Wellcome Monographs", // Title of the collection that the book being converted belongs to (OPTIONAL)
"source_type": "Book", // BCMS assigned "source-type" for the book being converted
"publisher_name": "Springer", // BCMS assigned name of the bibliographic publisher (OPTIONAL)
"publisher_loc": "Cham (CH)", // BCMS assigned publication place (OPTIONAL)
"package": "spr9783030473181.9876543210.2020_05_15-09_25_45.zip", // Name of the package with the XML file, in domain.job_id.timestamp format
"notification_recipients": { // List of emails for NCBI Task Manager (not CoKo) to send notifications to
"success": ["bookshelf@ncbi.nlm.nih.gov","fritz@publisher.org"],
"failure": ["bookshelf@ncbi.nlm.nih.gov"]
}
}
Current behaviour
Collection and publisher information is not getting passed to NCBI for XML conversions
XML
The collection id, collection title, publisher name, and publisher location are not getting passed for XML conversions that need them to meet Bookshelf and PubMed metadata requirements for display and indexing.
Steps to reproduce
- Create an XML wholebook
- Enter in source type and publisher name and publisher location using metadata UI
- Add a collection via settings
- Check that all above values are sent to NCBI for XML conversion in JSON per provided specifications
- Check converted XML that all these values are correctly tagged per those provided specifications
Environment
Possible solution
QA Steps
[To be completed by Coko once dev is done]
Scheduling
Fixing this issue is required for Priority 1: Deploy MVP