Ingram, an e-book distributor, specializes in print and online product distribution across the globe. They offer customized service to meet publisher needs and take care of the entire distribution process, be it sales or marketing. Ingram approached Newgen to solve their unique problem of validating the e-files they receive on a daily basis from different publishers and self-publishing authors.
The biggest concern for Ingram was e-file validation before distribution. It was found that they receive about 20k titles every week from different publishers and authors for distribution. And, out of these, a minimum of 10–15% of the input ePub had validating issues and the distributor was unable to take them for direct publishing. This was when Ingram started looking for a partner who would offer cost-effective services with quality, someone who would align with their standards for fixing the validation issues in a short span of time (about a day or two).
Some of the key validation issues faced were
- Inconsistency in the entry between OPF and NCX
- Failure to Unzip the ePub files
- Corrupted ePub Package
- Unwanted <span> and Unicode entity values
- Image resolution issues
The Solution Delivered
Newgen developed a web portal that would enable Ingram to deliver 20k titles everyday. The automation led to minimal human intervention, thereby reducing human errors. This API-based web portal, powered by Slim Framework, also fixes tagging issues automatically and creates the ePub as per the latest standard of IDPF. The backed validation and fixing process of the portal was built in NodeJ along with a database powered by MySQL.
Here are the key features of the ePub fix tool:
Automation validation against the ePub check version.
Autofixing of issues based on API coding within the system.
- Quick Access
Extract the text content of the ePUB file in the original book.
Easily calculate the character count, elements and images.
- Deliver Error-Free Files
Validate the links thoroughly and fix issues, if any.
Check for the IDPF family of errors and correct the issues.
NAV and NCX checking and validation.
- Compliance with Required Standards
Ensure that fonts and attributes align with the specified guidelines.
Check for Unicode in UTF-8 format.
The Impact Created
Through the web-portal platform, validation was done with minimal human effort resulting in reduced time. For instance, validation of 2500 titles took only 2 hours which was 30 times faster than manual effort. Ingram was impressed with seeing the result delivered to them within a short period of time.