blob: e7a51abc6f0a5309478b780c8a710b1c101d2453 [file] [log] [blame] [view]
David Tolnay6eacaff2016-10-23 15:20:23 -07001Nom parser for Rust source code
2===============================
David Tolnay35161ff2016-09-03 11:33:15 -07003
David Tolnayac9953b2016-09-07 08:37:12 -07004[![Build Status](https://api.travis-ci.org/dtolnay/syn.svg?branch=master)](https://travis-ci.org/dtolnay/syn)
5[![Latest Version](https://img.shields.io/crates/v/syn.svg)](https://crates.io/crates/syn)
David Tolnay87003d02018-05-20 19:45:13 -07006[![Rust Documentation](https://img.shields.io/badge/api-rustdoc-blue.svg)](https://docs.rs/syn/0.14/syn/)
David Tolnay8659a232018-03-31 22:54:03 +02007[![Rustc Version 1.15+](https://img.shields.io/badge/rustc-1.15+-lightgray.svg)](https://blog.rust-lang.org/2017/02/02/Rust-1.15.html)
David Tolnayac9953b2016-09-07 08:37:12 -07008
David Tolnayc088adb2018-01-01 00:26:05 -05009Syn is a parsing library for parsing a stream of Rust tokens into a syntax tree
10of Rust source code.
David Tolnay35161ff2016-09-03 11:33:15 -070011
David Tolnayc088adb2018-01-01 00:26:05 -050012Currently this library is geared toward the [custom derive] use case but
13contains some APIs that may be useful for Rust procedural macros more generally.
David Tolnayf939f352016-09-11 18:00:09 -070014
David Tolnayc088adb2018-01-01 00:26:05 -050015[custom derive]: https://github.com/rust-lang/rfcs/blob/master/text/1681-macros-1.1.md
David Tolnayf939f352016-09-11 18:00:09 -070016
David Tolnayc088adb2018-01-01 00:26:05 -050017- **Data structures** — Syn provides a complete syntax tree that can represent
18 any valid Rust source code. The syntax tree is rooted at [`syn::File`] which
19 represents a full source file, but there are other entry points that may be
20 useful to procedural macros including [`syn::Item`], [`syn::Expr`] and
21 [`syn::Type`].
David Tolnayfb9f7042016-12-22 12:31:39 -050022
David Tolnayc088adb2018-01-01 00:26:05 -050023- **Custom derives** — Of particular interest to custom derives is
24 [`syn::DeriveInput`] which is any of the three legal input items to a derive
25 macro. An example below shows using this type in a library that can derive
26 implementations of a trait of your own.
27
28- **Parser combinators** — Parsing in Syn is built on a suite of public parser
29 combinator macros that you can use for parsing any token-based syntax you
30 dream up within a `functionlike!(...)` procedural macro. Every syntax tree
31 node defined by Syn is individually parsable and may be used as a building
32 block for custom syntaxes, or you may do it all yourself working from the most
33 primitive tokens.
34
35- **Location information** — Every token parsed by Syn is associated with a
36 `Span` that tracks line and column information back to the source of that
37 token. These spans allow a procedural macro to display detailed error messages
38 pointing to all the right places in the user's code. There is an example of
39 this below.
40
41- **Feature flags** — Functionality is aggressively feature gated so your
42 procedural macros enable only what they need, and do not pay in compile time
43 for all the rest.
44
David Tolnay87003d02018-05-20 19:45:13 -070045[`syn::File`]: https://docs.rs/syn/0.14/syn/struct.File.html
46[`syn::Item`]: https://docs.rs/syn/0.14/syn/enum.Item.html
47[`syn::Expr`]: https://docs.rs/syn/0.14/syn/enum.Expr.html
48[`syn::Type`]: https://docs.rs/syn/0.14/syn/enum.Type.html
49[`syn::DeriveInput`]: https://docs.rs/syn/0.14/syn/struct.DeriveInput.html
David Tolnayc088adb2018-01-01 00:26:05 -050050
51If you get stuck with anything involving procedural macros in Rust I am happy to
52provide help even if the issue is not related to Syn. Please file a ticket in
53this repo.
54
55*Version requirement: Syn supports any compiler version back to Rust's very
56first support for procedural macros in Rust 1.15.0. Some features especially
57around error reporting are only available in newer compilers or on the nightly
58channel.*
59
David Tolnay86031562018-05-20 22:39:09 -070060[*Release notes*](https://github.com/dtolnay/syn/releases)
61
David Tolnayc088adb2018-01-01 00:26:05 -050062## Example of a custom derive
63
64The canonical custom derive using Syn looks like this. We write an ordinary Rust
65function tagged with a `proc_macro_derive` attribute and the name of the trait
66we are deriving. Any time that derive appears in the user's code, the Rust
67compiler passes their data structure as tokens into our macro. We get to execute
68arbitrary Rust code to figure out what to do with those tokens, then hand some
69tokens back to the compiler to compile into the user's crate.
70
71[`TokenStream`]: https://doc.rust-lang.org/proc_macro/struct.TokenStream.html
David Tolnay6c9f5b62016-09-13 15:19:22 -070072
David Tolnay69b538e2016-09-23 19:59:48 -070073```toml
74[dependencies]
David Tolnay87003d02018-05-20 19:45:13 -070075syn = "0.14"
76quote = "0.6"
David Tolnay69b538e2016-09-23 19:59:48 -070077
78[lib]
David Tolnay45cc8492016-10-08 20:52:03 -070079proc-macro = true
David Tolnay69b538e2016-09-23 19:59:48 -070080```
81
White-Oak82d0db72016-09-13 21:45:58 +030082```rust
David Tolnay45cc8492016-10-08 20:52:03 -070083extern crate proc_macro;
David Tolnayb4c63262016-09-23 20:03:06 -070084extern crate syn;
David Tolnay6c9f5b62016-09-13 15:19:22 -070085
White-Oak82d0db72016-09-13 21:45:58 +030086#[macro_use]
87extern crate quote;
White-Oak82d0db72016-09-13 21:45:58 +030088
David Tolnayc088adb2018-01-01 00:26:05 -050089use proc_macro::TokenStream;
90use syn::DeriveInput;
91
David Tolnay45cc8492016-10-08 20:52:03 -070092#[proc_macro_derive(MyMacro)]
David Tolnayb4c63262016-09-23 20:03:06 -070093pub fn my_macro(input: TokenStream) -> TokenStream {
David Tolnayc088adb2018-01-01 00:26:05 -050094 // Parse the input tokens into a syntax tree
95 let input: DeriveInput = syn::parse(input).unwrap();
White-Oak82d0db72016-09-13 21:45:58 +030096
David Tolnay6c9f5b62016-09-13 15:19:22 -070097 // Build the output, possibly using quasi-quotation
98 let expanded = quote! {
99 // ...
100 };
101
David Tolnayc088adb2018-01-01 00:26:05 -0500102 // Hand the output tokens back to the compiler
103 expanded.into()
White-Oak82d0db72016-09-13 21:45:58 +0300104}
105```
David Tolnay6c9f5b62016-09-13 15:19:22 -0700106
David Tolnayc088adb2018-01-01 00:26:05 -0500107The [`heapsize`] example directory shows a complete working Macros 1.1
108implementation of a custom derive. It works on any Rust compiler \>=1.15.0. The
109example derives a `HeapSize` trait which computes an estimate of the amount of
110heap memory owned by a value.
David Tolnayb988b6d2016-10-05 00:12:37 -0700111
David Tolnayc088adb2018-01-01 00:26:05 -0500112[`heapsize`]: examples/heapsize
David Tolnayb988b6d2016-10-05 00:12:37 -0700113
114```rust
David Tolnayc088adb2018-01-01 00:26:05 -0500115pub trait HeapSize {
116 /// Total number of bytes of heap memory owned by `self`.
117 fn heap_size_of_children(&self) -> usize;
David Tolnayb988b6d2016-10-05 00:12:37 -0700118}
119```
120
David Tolnayc088adb2018-01-01 00:26:05 -0500121The custom derive allows users to write `#[derive(HeapSize)]` on data structures
122in their program.
David Tolnayb988b6d2016-10-05 00:12:37 -0700123
124```rust
David Tolnayc088adb2018-01-01 00:26:05 -0500125#[derive(HeapSize)]
126struct Demo<'a, T: ?Sized> {
127 a: Box<T>,
128 b: u8,
129 c: &'a str,
130 d: String,
David Tolnayb988b6d2016-10-05 00:12:37 -0700131}
132```
133
David Tolnayc088adb2018-01-01 00:26:05 -0500134## Spans and error reporting
David Tolnayc2263f32017-03-09 19:20:52 -0800135
David Tolnayc088adb2018-01-01 00:26:05 -0500136The [`heapsize2`] example directory is an extension of the `heapsize` example
137that demonstrates some of the hygiene and error reporting properties of Macros
1382.0. This example currently requires a nightly Rust compiler \>=1.24.0-nightly
139but we are working to stabilize all of the APIs involved.
David Tolnayc2263f32017-03-09 19:20:52 -0800140
David Tolnayc088adb2018-01-01 00:26:05 -0500141[`heapsize2`]: examples/heapsize2
David Tolnay736829a2016-12-22 15:55:53 -0500142
David Tolnayc088adb2018-01-01 00:26:05 -0500143The token-based procedural macro API provides great control over where the
144compiler's error messages are displayed in user code. Consider the error the
145user sees if one of their field types does not implement `HeapSize`.
David Tolnay736829a2016-12-22 15:55:53 -0500146
147```rust
David Tolnayc088adb2018-01-01 00:26:05 -0500148#[derive(HeapSize)]
149struct Broken {
150 ok: String,
151 bad: std::thread::Thread,
David Tolnay736829a2016-12-22 15:55:53 -0500152}
153```
154
David Tolnayc088adb2018-01-01 00:26:05 -0500155In the Macros 1.1 string-based procedural macro world, the resulting error would
156point unhelpfully to the invocation of the derive macro and not to the actual
157problematic field.
158
159```
160error[E0599]: no method named `heap_size_of_children` found for type `std::thread::Thread` in the current scope
161 --> src/main.rs:4:10
162 |
1634 | #[derive(HeapSize)]
164 | ^^^^^^^^
165```
166
167By tracking span information all the way through the expansion of a procedural
168macro as shown in the `heapsize2` example, token-based macros in Syn are able to
169trigger errors that directly pinpoint the source of the problem.
170
171```
172error[E0277]: the trait bound `std::thread::Thread: HeapSize` is not satisfied
173 --> src/main.rs:7:5
174 |
1757 | bad: std::thread::Thread,
176 | ^^^^^^^^^^^^^^^^^^^^^^^^ the trait `HeapSize` is not implemented for `std::thread::Thread`
177```
178
179## Parsing a custom syntax using combinators
180
181The [`lazy-static`] example directory shows the implementation of a
182`functionlike!(...)` procedural macro in which the input tokens are parsed using
183[`nom`]-style parser combinators.
184
185[`lazy-static`]: examples/lazy-static
186[`nom`]: https://github.com/Geal/nom
187
188The example reimplements the popular `lazy_static` crate from crates.io as a
189procedural macro.
190
191```
192lazy_static! {
193 static ref USERNAME: Regex = Regex::new("^[a-z0-9_-]{3,16}$").unwrap();
194}
195```
196
197The implementation shows how to trigger custom warnings and error messages on
198the macro input.
199
200```
201warning: come on, pick a more creative name
202 --> src/main.rs:10:16
203 |
20410 | static ref FOO: String = "lazy_static".to_owned();
205 | ^^^
206```
207
David Tolnay941c0922016-12-22 16:06:27 -0500208## Debugging
209
210When developing a procedural macro it can be helpful to look at what the
211generated code looks like. Use `cargo rustc -- -Zunstable-options
David Tolnayc088adb2018-01-01 00:26:05 -0500212--pretty=expanded` or the [`cargo expand`] subcommand.
213
214[`cargo expand`]: https://github.com/dtolnay/cargo-expand
David Tolnay941c0922016-12-22 16:06:27 -0500215
216To show the expanded code for some crate that uses your procedural macro, run
217`cargo expand` from that crate. To show the expanded code for one of your own
218test cases, run `cargo expand --test the_test_case` where the last argument is
219the name of the test file without the `.rs` extension.
220
David Tolnay3bfbd542017-01-16 14:57:53 -0800221This write-up by Brandon W Maister discusses debugging in more detail:
David Tolnayc088adb2018-01-01 00:26:05 -0500222[Debugging Rust's new Custom Derive system][debugging].
223
224[debugging]: https://quodlibetor.github.io/posts/debugging-rusts-new-custom-derive-system/
David Tolnay3bfbd542017-01-16 14:57:53 -0800225
David Tolnay686f5042016-10-30 19:24:51 -0700226## Optional features
227
228Syn puts a lot of functionality behind optional features in order to optimize
David Tolnayc088adb2018-01-01 00:26:05 -0500229compile time for the most common use cases. The following features are
230available.
David Tolnay686f5042016-10-30 19:24:51 -0700231
David Tolnayc088adb2018-01-01 00:26:05 -0500232- **`derive`** *(enabled by default)* — Data structures for representing the
233 possible input to a custom derive, including structs and enums and types.
234- **`full`** — Data structures for representing the syntax tree of all valid
235 Rust source code, including items and expressions.
236- **`parsing`** *(enabled by default)* — Ability to parse input tokens into a
237 syntax tree node of a chosen type.
238- **`printing`** *(enabled by default)* — Ability to print a syntax tree node as
239 tokens of Rust source code.
240- **`visit`** — Trait for traversing a syntax tree.
241- **`visit-mut`** — Trait for traversing and mutating in place a syntax tree.
242- **`fold`** — Trait for transforming an owned syntax tree.
243- **`clone-impls`** *(enabled by default)* — Clone impls for all syntax tree
244 types.
245- **`extra-traits`** — Debug, Eq, PartialEq, Hash impls for all syntax tree
246 types.
David Tolnayd236b552018-04-08 09:02:55 -0700247- **`proc-macro`** *(enabled by default)* — Runtime dependency on the dynamic
248 library libproc_macro from rustc toolchain.
David Tolnayc088adb2018-01-01 00:26:05 -0500249
250## Nightly features
251
252By default Syn uses the [`proc-macro2`] crate to emulate the nightly compiler's
253procedural macro API in a stable way that works all the way back to Rust 1.15.0.
254This shim makes it possible to write code without regard for whether the current
255compiler version supports the features we use.
256
257[`proc-macro2`]: https://github.com/alexcrichton/proc-macro2
258
259On a nightly compiler, to eliminate the stable shim and use the compiler's
260`proc-macro` directly, add `proc-macro2` to your Cargo.toml and set its
261`"nightly"` feature which bypasses the stable shim.
262
263```toml
264[dependencies]
David Tolnay87003d02018-05-20 19:45:13 -0700265syn = "0.14"
266proc-macro2 = { version = "0.4", features = ["nightly"] }
David Tolnayc088adb2018-01-01 00:26:05 -0500267```
David Tolnayed7a5082016-10-30 20:06:29 -0700268
David Tolnay35161ff2016-09-03 11:33:15 -0700269## License
270
271Licensed under either of
272
273 * Apache License, Version 2.0 ([LICENSE-APACHE](LICENSE-APACHE) or http://www.apache.org/licenses/LICENSE-2.0)
274 * MIT license ([LICENSE-MIT](LICENSE-MIT) or http://opensource.org/licenses/MIT)
275
276at your option.
277
278### Contribution
279
280Unless you explicitly state otherwise, any contribution intentionally submitted
281for inclusion in this crate by you, as defined in the Apache-2.0 license, shall
282be dual licensed as above, without any additional terms or conditions.