README
ilib-loctool-json
Ilib loctool plugin to parse and localize json files.
This plugin can parse and localize json files by either understanding the schema via a jsonschema v7 file, or by defaulting to a list of key/value pairs.
Style of JSON
By default, this plugin assumes that the json file conforms to the Json5 spec. Earlier forms of json are subsets of Json5, so Json5 is the most flexible standard to process.
This means you can put comments in the json file itself or have trailing commas, etc. Some of these comments are processed along with string values to add optional translator comments to the strings. The localized output files will conform to older versions of json
Methods of Localizing
Json files are localized in one of the following methods:
- copy. Make a copy of the entire source file and replace values of certain properties in the copy with translations. The copy can be sent to any output desired directory using a template. Copy is the default method with which localized json files are handled.
- sparse. Make a copy of the source file where only the localized properties appear. The copy has the same structure as the original json file, but only properties where the value is localized appear in the output.
The first method, localizing the entire file, has the advantage that you don't need to change your code in order to read the translated file. You just need to pick the right file for the current locale.
The second method is similar to the first method above, except that it can save space because all of the non-localizable data in the original json file is not duplicated. In this case, you would need to load in the English file first, then mix-in the localized file to override all the localizable strings in order to create the full data with localized data embedded in it.
Key/Value Pairs
In the absence of any schema information, a default schema will be applied. The plugin will assume that the json file is a simple object where the property names are the keys and the property values are the strings to translate.
Example:
{
"unique id 1": "first string to translate",
"unique id 2": "second string to translate",
etc.
}
Essentially, this means that we assume that the file has the following json schema:
{
"$schema": "http://json-schema.org/draft-07/schema#",
"$id": "strings-schema",
"type": "object",
"description": "A flat object of properties with localizable values",
"additionalProperties": {
"type": "string",
"localizable": true
}
}
Configuring the Plugin
The plugin will look for the json
property within the settings
of your project.json
file. The following settings are
used within the json property:
- schemas: a string naming a directory full of json schema files, or an array of strings naming some json schema files or directories to load. If the json file does not fit any of the schema (ie. it does not validate according to any one of the schema), then that file will be skipped and not localized.
- mappings: a mapping between file matchers and an object that gives
info used to localize the files that match it. This allows different
json files within the project to be processed with different schema.
The matchers are
a micromatch-style string,
similar to the the
includes
andexcludes
section of aproject.json
file. The value of that mapping is an object that can contain the following properties:- schema: schema to use with that matcher. The schema is
specified using the
$id
of one of the schemas loaded in theschemas
property above. The default schema is "strings-schema" which is given in the previous section. - method: one of "copy" or "sparse"
- copy: make a copy of the source file and localize the string contents. (This is the default method if not specified explicitly.)
- sparse: make a copy of the source file but only include localized strings
- template: a path template to use to generate the path to
the translated
output files. The template replaces strings in square brackets
with special values, and keeps any characters intact that are
not in square brackets. The default template, if not specified is
"resources/[localeDir]/[filename]". The plugin recognizes
and replaces the following strings in template strings:
- [dir] the original directory where the matched source file came from. This is given as a directory that is relative to the root of the project. eg. "foo/bar/strings.json" -> "foo/bar"
- [filename] the file name of the matching file. eg. "foo/bar/strings.json" -> "strings.json"
- [basename] the basename of the matching file without any extension eg. "foo/bar/strings.json" -> "strings"
- [extension] the extension part of the file name of the source file. etc. "foo/bar/strings.json" -> "json"
- [locale] the full BCP-47 locale specification for the target locale eg. "zh-Hans-CN" -> "zh-Hans-CN"
- [language] the language portion of the full locale eg. "zh-Hans-CN" -> "zh"
- [script] the script portion of the full locale eg. "zh-Hans-CN" -> "Hans"
- [region] the region portion of the full locale eg. "zh-Hans-CN" -> "CN"
- [localeDir] the full locale where each portion of the locale is a directory in this order: [langage], [script], [region]. eg, "zh-Hans-CN" -> "zh/Hans/CN", but "en" -> "en".
- [localeUnder] the full BCP-47 locale specification, but using underscores to separate the locale parts instead of dashes. eg. "zh-Hans-CN" -> "zh_Hans_CN"
- schema: schema to use with that matcher. The schema is
specified using the
Example configuration:
{
"settings": {
"json": {
"schemas": "./json/schemas",
"mappings": {
"**/appinfo.json": {
"schema": "http://www.lge.com/json/appinfo",
"method": "sparse",
"template": "resources/[localeDir]/appinfo.json"
},
"src/**/strings.json": {
"schema": "http://www.lge.com/json/strings",
"method": "copy",
"template": "[dir]/strings.[locale].json"
}
}
}
}
}
In the above example, any file named appinfo.json
will be parsed with the
http://www.lge.com/json/appinfo
schema. Because the method is sparse
,
only the parts of the json file that have translated strings in them will
appear in the output. The output file name is sent to the resources
directory.
In the second part of the example, any strings.json
file that appears in
the src
directory will be parsed with the schema http://www.lge.com/json/strings
and a full copy of the file, including the parts that were not localized,
will be sent to the same directory as the source file. However, the
localized file name will also contain the name of the locale to distinguish
it from the source file.
If the name of the localized file that the template produces is the same as the source file name, this plugin will throw an exception, the file will not be localized, and the loctool will continue on to the next file.
Extensions to JSON Schema
In regular JSON Schemas, there is no built-in way to indicate that any value is localizable. However, the JSON Schema spec allows for creating extensions to the keywords of your json schema and specifies that implementation must ignore any keywords that it does not recognize.
localizable
Keyword
The This plugin recognizes the new localizable
keyword as an extension. The
localizable
keyword specifies that the value of a property should be
localized by the methods provided above. By default, values are not
localizable unless explicitly specified using the localizable
keyword.
The localizable
keyword is ignored for null or undefined values. For
the primitive types string, integer, number, or boolean values, the value
is directly localizable. Each property will result in a translation unit
that the translators may localize.
When the localizable
keyword is given for arrays,
every item in the array is localizable and must be of a primitive type
(string, integer, number, or boolean). If any array entries are not of
a primitive type, an exception will be thrown and the localization will
be halted.
When the localizable
keyword
is given for an object type, that object encodes a plural string. The
subproperties allowed in that object are those defined in the Unicode
plural rules:
- zero
- one
- two
- few
- many
- other
The "one" and the "other" property are required for source files in English. Other languages will have different combinations of plural categories.
Example json schema file:
{
"$id": "",
"title": "an location data",
"properties": {
"name": {
"type": "string",
"description": "the name of this address, like 'work' or 'home'. This string is not localized",
"$comment": "note that this is not localizable"
},
"country": {
"type": "string",
"localizable": true,
"description": "the name of the country localized the the current language"
},
"latitude": {
"type": "number",
"localizable": true,
"description": "an example of a localizable number"
},
"places": {
"type": "array",
"localizable": true,
"items": {
"type": "string"
}
},
"plural": {
"type": "object",
"additionalProperties": {
"type": "object",
"localizable": true,
"required": [
"one",
"other"
],
"properties": {
"zero": {"type": "string"},
"one": {"type": "string"},
"two": {"type": "string"},
"few": {"type": "string"},
"many": {"type": "string"},
"other": {"type": "string"}
},
"additionalProperties": false
}
}
}
}
The localizable
keyword is also ignored for const
value types, as the
value is supposed to remain constant.
For strings that have an enum
keyword, each of the values in the enum
will
not be translated as well, as the code that reads this json file is explicitly
expecting one of the given fixed values.
Not a Validator
Please note that this plugin is not a json schema validator, though it works in similar ways. If the json being parsed does not conform to the given schema, no errors or exceptions will be thrown, but it will print some warnings to the output. The strings will merely not be extracted/localized as expected.
License
This plugin is license under Apache2. See the LICENSE file for more details.
Release Notes
v1.2.3
- fix issue of the parser that resulted in skipping objects with
single boolean field which equals
false
v1.2.2
- Fix a bug where the pseudo locales were not initialized properly. This fix gets the right set of locales from the project settings to see if any of them are pseudo locales.
v1.2.1
- Fixed bug where it was not generating pseudo localized text properly for missing translations when pseudo is turned on
v1.2.0
- add support for $refs in array schema definitions
v1.1.1
- few README.md files (this file!)
v1.1.0
- extended array support:
- corrected handling of primitive types. Values are casted to their initial type upon translation
- added support for array of objects
- fixed a bug of a copy method that results in json parts, that are not defined in the schema, to be removed from localized files
v1.0.1
- fixed a bug where plural strings resources were not extracted to the new strings file properly
v1.0.0
- initial version
- support json schema and also default schema (key/value pairs)