commit | 75c511f725957ff727ef7937300cbde75d016cfa | [log] [tgz] |
---|---|---|
author | Eric Arseneau <earseneau@google.com> | Mon Dec 06 14:36:37 2021 -0800 |
committer | Eric Arseneau <earseneau@google.com> | Mon Dec 06 14:36:37 2021 -0800 |
tree | f18e9d550e3389a6daa24962b2b82be5ddc7081a | |
parent | 62c01e8c4b082614f781a8af490a09f3eebc97ab [diff] | |
parent | 54d087b71aab080b471071b9fbfcd6c03cf3bf6b [diff] |
Merge mpr-2021-11-05 Change-Id: Ib7193490d1f9461cc39de8dc13703aaaedc45ca8
This library exists to provide case conversion between common cases like CamelCase and snake_case. It is intended to be unicode aware, internally consistent, and reasonably well performing.
Word boundaries are defined as the "unicode words" defined in the unicode_segmentation
library, as well as within those words in this manner:
That is, "HelloWorld" is segmented Hello|World
whereas "XMLHttpRequest" is segmented XML|Http|Request
.
Characters not within words (such as spaces, punctuations, and underscores) are not included in the output string except as they are a part of the case being converted to. Multiple adjacent word boundaries (such as a series of underscores) are folded into one. ("hello__world" in snake case is therefore "hello_world", not the exact same string). Leading or trailing word boundary indicators are dropped, except insofar as CamelCase capitalizes the first word.
PRs of additional well-established cases welcome.
This library is a little bit opinionated (dropping punctuation, for example). If that doesn't fit your use case, I hope there is another crate that does. I would prefer not to receive PRs to make this behavior more configurable.
Bug reports & fixes always welcome. :-)
heck is distributed under the terms of both the MIT license and the Apache License (Version 2.0).
See LICENSE-APACHE and LICENSE-MIT for details.