commit | a211ee7d7cf25e1fc13b65dd020a29d5a8714479 | [log] [tgz] |
---|---|---|
author | Scott Lobdell <slobdell@google.com> | Thu Oct 15 11:45:37 2020 -0700 |
committer | Scott Lobdell <slobdell@google.com> | Thu Oct 15 11:45:37 2020 -0700 |
tree | 9927a95f1dcd532828581146a37b1ddfa539a15f | |
parent | dea77846c5b8308b35bfea373db1ce80ef0892e4 [diff] | |
parent | fdc7f94d63dffdbf0a3a505b9fb2e294cf7f51e4 [diff] |
Merge SP1A.201015.001 Change-Id: I08b4725f88aa92eb7fe952e75e80cf2a9e2fb8ba
This library exists to provide case conversion between common cases like CamelCase and snake_case. It is intended to be unicode aware, internally consistent, and reasonably well performing.
Word boundaries are defined as the "unicode words" defined in the unicode_segmentation
library, as well as within those words in this manner:
That is, "HelloWorld" is segmented Hello|World
whereas "XMLHttpRequest" is segmented XML|Http|Request
.
Characters not within words (such as spaces, punctuations, and underscores) are not included in the output string except as they are a part of the case being converted to. Multiple adjacent word boundaries (such as a series of underscores) are folded into one. ("hello__world" in snake case is therefore "hello_world", not the exact same string). Leading or trailing word boundary indicators are dropped, except insofar as CamelCase capitalizes the first word.
PRs of additional well-established cases welcome.
This library is a little bit opinionated (dropping punctuation, for example). If that doesn't fit your use case, I hope there is another crate that does. I would prefer not to receive PRs to make this behavior more configurable.
Bug reports & fixes always welcome. :-)
heck is distributed under the terms of both the MIT license and the Apache License (Version 2.0).
See LICENSE-APACHE and LICENSE-MIT for details.