Conversational Speech dataset

Publicly available dataset of conversational speech audio recordings.
This CL includes the following:
- README.md: dataset description file, it also includes the scripts
- *.wav.sha1: hash files for each audio track in the dataset

The overall size of the wav files is ~36MB.
The primary intended use of this dataset is in combination with the conversational speech tool (see https://chromium.googlesource.com/external/webrtc/+/master/webrtc/modules/audio_processing/test/conversational_speech/), using which longer recordings with custom turn switch timing can be created.

BUG=webrtc:7218

Review-Url: https://codereview.webrtc.org/2869833002
Cr-Commit-Position: refs/heads/master@{#18068}
67 files changed
tree: e2848f33ca6053caf403a3b83fe82eb6269dca58
  1. build_overrides/
  2. data/
  3. infra/
  4. resources/
  5. tools_webrtc/
  6. webrtc/
  7. .clang-format
  8. .git-blame-ignore-revs
  9. .gitignore
  10. .gn
  11. AUTHORS
  12. BUILD.gn
  13. check_root_dir.py
  14. cleanup_links.py
  15. codereview.settings
  16. DEPS
  17. LICENSE
  18. license_template.txt
  19. LICENSE_THIRD_PARTY
  20. OWNERS
  21. PATENTS
  22. PRESUBMIT.py
  23. pylintrc
  24. README.md
  25. WATCHLISTS
README.md

WebRTC is a free, open software project that provides browsers and mobile applications with Real-Time Communications (RTC) capabilities via simple APIs. The WebRTC components have been optimized to best serve this purpose.

Our mission: To enable rich, high-quality RTC applications to be developed for the browser, mobile platforms, and IoT devices, and allow them all to communicate via a common set of protocols.

The WebRTC initiative is a project supported by Google, Mozilla and Opera, amongst others.

Development

See http://www.webrtc.org/native-code/development for instructions on how to get started developing with the native code.

More info