Question 1 of 30
A digital archive is tasked with preserving a collection of historical documents from Hong Kong dating back to the early 20th century. The documents are primarily written in Chinese, but closer inspection reveals that they contain a mix of both Cantonese and Mandarin, with Cantonese being the predominant language used in daily communication and legal proceedings at the time. The archive aims to adhere to ISO 20614:2017 standards for interoperability and preservation, ensuring that the language information is accurately captured for future researchers and automated processing systems. Given the linguistic context and the requirements for precise language identification, which of the following approaches to language coding would be the MOST appropriate to apply to these documents according to ISO 639 standards, considering the need to differentiate between the two Chinese language varieties for long-term accessibility and understanding? The archive\'s policy emphasizes retaining the nuances of the original documents, including dialectal variations.
Applying ISO 639 language subtags (as defined by IETF BCP 47) to differentiate between Mandarin and Cantonese (e.g., 'zh-cmn' for Mandarin, 'zh-yue' for Cantonese) in conjunction with the base language code.
Utilizing the ISO 639-1 code 'zh' or the ISO 639-2 code 'chi' to represent the language as simply "Chinese," without specifying the dialectal variations.
Creating custom, non-standard language codes specific to the archive, documenting them internally for future reference, as ISO 639 does not explicitly differentiate between Cantonese and Mandarin.
Using the ISO 639-3 codes only for the most prominent words in each document, and then creating a separate metadata field describing the presence of both languages without assigning specific codes to each section.

Preparing for ISO 20614:2017 Information and documentation -- Data exchange protocol for interoperability and preservation? Now land the interview.

73% of qualified candidates get rejected because of weak resumes. Build an ATS-optimized, recruiter-ready resume in under 5 minutes - free to start.

Build My Resume Free