blob: a6c718bf25208a5c4020aeb26c6a3f0a9c9aec75 [file] [log] [blame] [view]
David Tolnay6eacaff2016-10-23 15:20:23 -07001Nom parser for Rust source code
2===============================
David Tolnay35161ff2016-09-03 11:33:15 -07003
David Tolnayac9953b2016-09-07 08:37:12 -07004[![Build Status](https://api.travis-ci.org/dtolnay/syn.svg?branch=master)](https://travis-ci.org/dtolnay/syn)
5[![Latest Version](https://img.shields.io/crates/v/syn.svg)](https://crates.io/crates/syn)
David Tolnay552ff332018-03-31 23:04:48 +02006[![Rust Documentation](https://img.shields.io/badge/api-rustdoc-blue.svg)](https://docs.rs/syn/0.13/syn/)
David Tolnay8659a232018-03-31 22:54:03 +02007[![Rustc Version 1.15+](https://img.shields.io/badge/rustc-1.15+-lightgray.svg)](https://blog.rust-lang.org/2017/02/02/Rust-1.15.html)
David Tolnayac9953b2016-09-07 08:37:12 -07008
David Tolnayc088adb2018-01-01 00:26:05 -05009Syn is a parsing library for parsing a stream of Rust tokens into a syntax tree
10of Rust source code.
David Tolnay35161ff2016-09-03 11:33:15 -070011
David Tolnayc088adb2018-01-01 00:26:05 -050012Currently this library is geared toward the [custom derive] use case but
13contains some APIs that may be useful for Rust procedural macros more generally.
David Tolnayf939f352016-09-11 18:00:09 -070014
David Tolnayc088adb2018-01-01 00:26:05 -050015[custom derive]: https://github.com/rust-lang/rfcs/blob/master/text/1681-macros-1.1.md
David Tolnayf939f352016-09-11 18:00:09 -070016
David Tolnayc088adb2018-01-01 00:26:05 -050017- **Data structures** — Syn provides a complete syntax tree that can represent
18 any valid Rust source code. The syntax tree is rooted at [`syn::File`] which
19 represents a full source file, but there are other entry points that may be
20 useful to procedural macros including [`syn::Item`], [`syn::Expr`] and
21 [`syn::Type`].
David Tolnayfb9f7042016-12-22 12:31:39 -050022
David Tolnayc088adb2018-01-01 00:26:05 -050023- **Custom derives** — Of particular interest to custom derives is
24 [`syn::DeriveInput`] which is any of the three legal input items to a derive
25 macro. An example below shows using this type in a library that can derive
26 implementations of a trait of your own.
27
28- **Parser combinators** — Parsing in Syn is built on a suite of public parser
29 combinator macros that you can use for parsing any token-based syntax you
30 dream up within a `functionlike!(...)` procedural macro. Every syntax tree
31 node defined by Syn is individually parsable and may be used as a building
32 block for custom syntaxes, or you may do it all yourself working from the most
33 primitive tokens.
34
35- **Location information** — Every token parsed by Syn is associated with a
36 `Span` that tracks line and column information back to the source of that
37 token. These spans allow a procedural macro to display detailed error messages
38 pointing to all the right places in the user's code. There is an example of
39 this below.
40
41- **Feature flags** — Functionality is aggressively feature gated so your
42 procedural macros enable only what they need, and do not pay in compile time
43 for all the rest.
44
David Tolnay552ff332018-03-31 23:04:48 +020045[`syn::File`]: https://docs.rs/syn/0.13/syn/struct.File.html
46[`syn::Item`]: https://docs.rs/syn/0.13/syn/enum.Item.html
47[`syn::Expr`]: https://docs.rs/syn/0.13/syn/enum.Expr.html
48[`syn::Type`]: https://docs.rs/syn/0.13/syn/enum.Type.html
49[`syn::DeriveInput`]: https://docs.rs/syn/0.13/syn/struct.DeriveInput.html
David Tolnayc088adb2018-01-01 00:26:05 -050050
51If you get stuck with anything involving procedural macros in Rust I am happy to
52provide help even if the issue is not related to Syn. Please file a ticket in
53this repo.
54
55*Version requirement: Syn supports any compiler version back to Rust's very
56first support for procedural macros in Rust 1.15.0. Some features especially
57around error reporting are only available in newer compilers or on the nightly
58channel.*
59
60## Example of a custom derive
61
62The canonical custom derive using Syn looks like this. We write an ordinary Rust
63function tagged with a `proc_macro_derive` attribute and the name of the trait
64we are deriving. Any time that derive appears in the user's code, the Rust
65compiler passes their data structure as tokens into our macro. We get to execute
66arbitrary Rust code to figure out what to do with those tokens, then hand some
67tokens back to the compiler to compile into the user's crate.
68
69[`TokenStream`]: https://doc.rust-lang.org/proc_macro/struct.TokenStream.html
David Tolnay6c9f5b62016-09-13 15:19:22 -070070
David Tolnay69b538e2016-09-23 19:59:48 -070071```toml
72[dependencies]
David Tolnay552ff332018-03-31 23:04:48 +020073syn = "0.13"
74quote = "0.5"
David Tolnay69b538e2016-09-23 19:59:48 -070075
76[lib]
David Tolnay45cc8492016-10-08 20:52:03 -070077proc-macro = true
David Tolnay69b538e2016-09-23 19:59:48 -070078```
79
White-Oak82d0db72016-09-13 21:45:58 +030080```rust
David Tolnay45cc8492016-10-08 20:52:03 -070081extern crate proc_macro;
David Tolnayb4c63262016-09-23 20:03:06 -070082extern crate syn;
David Tolnay6c9f5b62016-09-13 15:19:22 -070083
White-Oak82d0db72016-09-13 21:45:58 +030084#[macro_use]
85extern crate quote;
White-Oak82d0db72016-09-13 21:45:58 +030086
David Tolnayc088adb2018-01-01 00:26:05 -050087use proc_macro::TokenStream;
88use syn::DeriveInput;
89
David Tolnay45cc8492016-10-08 20:52:03 -070090#[proc_macro_derive(MyMacro)]
David Tolnayb4c63262016-09-23 20:03:06 -070091pub fn my_macro(input: TokenStream) -> TokenStream {
David Tolnayc088adb2018-01-01 00:26:05 -050092 // Parse the input tokens into a syntax tree
93 let input: DeriveInput = syn::parse(input).unwrap();
White-Oak82d0db72016-09-13 21:45:58 +030094
David Tolnay6c9f5b62016-09-13 15:19:22 -070095 // Build the output, possibly using quasi-quotation
96 let expanded = quote! {
97 // ...
98 };
99
David Tolnayc088adb2018-01-01 00:26:05 -0500100 // Hand the output tokens back to the compiler
101 expanded.into()
White-Oak82d0db72016-09-13 21:45:58 +0300102}
103```
David Tolnay6c9f5b62016-09-13 15:19:22 -0700104
David Tolnayc088adb2018-01-01 00:26:05 -0500105The [`heapsize`] example directory shows a complete working Macros 1.1
106implementation of a custom derive. It works on any Rust compiler \>=1.15.0. The
107example derives a `HeapSize` trait which computes an estimate of the amount of
108heap memory owned by a value.
David Tolnayb988b6d2016-10-05 00:12:37 -0700109
David Tolnayc088adb2018-01-01 00:26:05 -0500110[`heapsize`]: examples/heapsize
David Tolnayb988b6d2016-10-05 00:12:37 -0700111
112```rust
David Tolnayc088adb2018-01-01 00:26:05 -0500113pub trait HeapSize {
114 /// Total number of bytes of heap memory owned by `self`.
115 fn heap_size_of_children(&self) -> usize;
David Tolnayb988b6d2016-10-05 00:12:37 -0700116}
117```
118
David Tolnayc088adb2018-01-01 00:26:05 -0500119The custom derive allows users to write `#[derive(HeapSize)]` on data structures
120in their program.
David Tolnayb988b6d2016-10-05 00:12:37 -0700121
122```rust
David Tolnayc088adb2018-01-01 00:26:05 -0500123#[derive(HeapSize)]
124struct Demo<'a, T: ?Sized> {
125 a: Box<T>,
126 b: u8,
127 c: &'a str,
128 d: String,
David Tolnayb988b6d2016-10-05 00:12:37 -0700129}
130```
131
David Tolnayc088adb2018-01-01 00:26:05 -0500132## Spans and error reporting
David Tolnayc2263f32017-03-09 19:20:52 -0800133
David Tolnayc088adb2018-01-01 00:26:05 -0500134The [`heapsize2`] example directory is an extension of the `heapsize` example
135that demonstrates some of the hygiene and error reporting properties of Macros
1362.0. This example currently requires a nightly Rust compiler \>=1.24.0-nightly
137but we are working to stabilize all of the APIs involved.
David Tolnayc2263f32017-03-09 19:20:52 -0800138
David Tolnayc088adb2018-01-01 00:26:05 -0500139[`heapsize2`]: examples/heapsize2
David Tolnay736829a2016-12-22 15:55:53 -0500140
David Tolnayc088adb2018-01-01 00:26:05 -0500141The token-based procedural macro API provides great control over where the
142compiler's error messages are displayed in user code. Consider the error the
143user sees if one of their field types does not implement `HeapSize`.
David Tolnay736829a2016-12-22 15:55:53 -0500144
145```rust
David Tolnayc088adb2018-01-01 00:26:05 -0500146#[derive(HeapSize)]
147struct Broken {
148 ok: String,
149 bad: std::thread::Thread,
David Tolnay736829a2016-12-22 15:55:53 -0500150}
151```
152
David Tolnayc088adb2018-01-01 00:26:05 -0500153In the Macros 1.1 string-based procedural macro world, the resulting error would
154point unhelpfully to the invocation of the derive macro and not to the actual
155problematic field.
156
157```
158error[E0599]: no method named `heap_size_of_children` found for type `std::thread::Thread` in the current scope
159 --> src/main.rs:4:10
160 |
1614 | #[derive(HeapSize)]
162 | ^^^^^^^^
163```
164
165By tracking span information all the way through the expansion of a procedural
166macro as shown in the `heapsize2` example, token-based macros in Syn are able to
167trigger errors that directly pinpoint the source of the problem.
168
169```
170error[E0277]: the trait bound `std::thread::Thread: HeapSize` is not satisfied
171 --> src/main.rs:7:5
172 |
1737 | bad: std::thread::Thread,
174 | ^^^^^^^^^^^^^^^^^^^^^^^^ the trait `HeapSize` is not implemented for `std::thread::Thread`
175```
176
177## Parsing a custom syntax using combinators
178
179The [`lazy-static`] example directory shows the implementation of a
180`functionlike!(...)` procedural macro in which the input tokens are parsed using
181[`nom`]-style parser combinators.
182
183[`lazy-static`]: examples/lazy-static
184[`nom`]: https://github.com/Geal/nom
185
186The example reimplements the popular `lazy_static` crate from crates.io as a
187procedural macro.
188
189```
190lazy_static! {
191 static ref USERNAME: Regex = Regex::new("^[a-z0-9_-]{3,16}$").unwrap();
192}
193```
194
195The implementation shows how to trigger custom warnings and error messages on
196the macro input.
197
198```
199warning: come on, pick a more creative name
200 --> src/main.rs:10:16
201 |
20210 | static ref FOO: String = "lazy_static".to_owned();
203 | ^^^
204```
205
David Tolnay941c0922016-12-22 16:06:27 -0500206## Debugging
207
208When developing a procedural macro it can be helpful to look at what the
209generated code looks like. Use `cargo rustc -- -Zunstable-options
David Tolnayc088adb2018-01-01 00:26:05 -0500210--pretty=expanded` or the [`cargo expand`] subcommand.
211
212[`cargo expand`]: https://github.com/dtolnay/cargo-expand
David Tolnay941c0922016-12-22 16:06:27 -0500213
214To show the expanded code for some crate that uses your procedural macro, run
215`cargo expand` from that crate. To show the expanded code for one of your own
216test cases, run `cargo expand --test the_test_case` where the last argument is
217the name of the test file without the `.rs` extension.
218
David Tolnay3bfbd542017-01-16 14:57:53 -0800219This write-up by Brandon W Maister discusses debugging in more detail:
David Tolnayc088adb2018-01-01 00:26:05 -0500220[Debugging Rust's new Custom Derive system][debugging].
221
222[debugging]: https://quodlibetor.github.io/posts/debugging-rusts-new-custom-derive-system/
David Tolnay3bfbd542017-01-16 14:57:53 -0800223
David Tolnay686f5042016-10-30 19:24:51 -0700224## Optional features
225
226Syn puts a lot of functionality behind optional features in order to optimize
David Tolnayc088adb2018-01-01 00:26:05 -0500227compile time for the most common use cases. The following features are
228available.
David Tolnay686f5042016-10-30 19:24:51 -0700229
David Tolnayc088adb2018-01-01 00:26:05 -0500230- **`derive`** *(enabled by default)* — Data structures for representing the
231 possible input to a custom derive, including structs and enums and types.
232- **`full`** — Data structures for representing the syntax tree of all valid
233 Rust source code, including items and expressions.
234- **`parsing`** *(enabled by default)* — Ability to parse input tokens into a
235 syntax tree node of a chosen type.
236- **`printing`** *(enabled by default)* — Ability to print a syntax tree node as
237 tokens of Rust source code.
238- **`visit`** — Trait for traversing a syntax tree.
239- **`visit-mut`** — Trait for traversing and mutating in place a syntax tree.
240- **`fold`** — Trait for transforming an owned syntax tree.
241- **`clone-impls`** *(enabled by default)* — Clone impls for all syntax tree
242 types.
243- **`extra-traits`** — Debug, Eq, PartialEq, Hash impls for all syntax tree
244 types.
245
246## Nightly features
247
248By default Syn uses the [`proc-macro2`] crate to emulate the nightly compiler's
249procedural macro API in a stable way that works all the way back to Rust 1.15.0.
250This shim makes it possible to write code without regard for whether the current
251compiler version supports the features we use.
252
253[`proc-macro2`]: https://github.com/alexcrichton/proc-macro2
254
255On a nightly compiler, to eliminate the stable shim and use the compiler's
256`proc-macro` directly, add `proc-macro2` to your Cargo.toml and set its
257`"nightly"` feature which bypasses the stable shim.
258
259```toml
260[dependencies]
David Tolnay552ff332018-03-31 23:04:48 +0200261syn = "0.13"
262proc-macro2 = { version = "0.3", features = ["nightly"] }
David Tolnayc088adb2018-01-01 00:26:05 -0500263```
David Tolnayed7a5082016-10-30 20:06:29 -0700264
David Tolnay35161ff2016-09-03 11:33:15 -0700265## License
266
267Licensed under either of
268
269 * Apache License, Version 2.0 ([LICENSE-APACHE](LICENSE-APACHE) or http://www.apache.org/licenses/LICENSE-2.0)
270 * MIT license ([LICENSE-MIT](LICENSE-MIT) or http://opensource.org/licenses/MIT)
271
272at your option.
273
274### Contribution
275
276Unless you explicitly state otherwise, any contribution intentionally submitted
277for inclusion in this crate by you, as defined in the Apache-2.0 license, shall
278be dual licensed as above, without any additional terms or conditions.