Improve how we infer behavior from field types

As brought up in #4600 and #3661, there are cases where it would be nice to support additional types with special semantics for clap_derive fields. However, doing so for any existing isn't backwards compatible, because users might have custom value parsers that produce those types. In addition, currently in order to opt out of special behavior for field types of `Opt` or `Vec`, you need to fully qualify the paths for those types (ex. `::std::vec::Vec`), which is both undocumented, and non-obvious. It would be nice to have a consistent, easy to understand way to specify how 0 or more values for an option are aggregated into a single type.

In particular here are some cases that are currently difficult or impossible to include in a `Parser` derived using clap_derive:
- `Vec<Vec<T>>` containing the values passed in, grouped by occurrence
- `[T; N]`, where `num_args` is set to `N`
- `Vec<[T; N]>` would be a combination of the two above (with `num_args` set to `N`, but action set to `Append`)
- `Result<T, SomeError>` for an optional argument (like `Option<T>`, but return some default error if missing)
- Probably others


The current types with special behavior are described [here](https://docs.rs/clap/latest/clap/_derive/index.html#arg-types). It should also be noted, that if you specify an action of `Count` with an integral field type, also sort of fits in this category.

# Requirements

- Derive code continues to be easy to write and read
- Can both configure the `Arg` and extract the value from `ArgMatches` for our specialized types, including
  - `Option<T>`
  - `Option<Option<T>>`
  - `Vec<T>`
  - `Option<Vec<T>>`
  - `Vec<Vec<T>>` (v5)
  - `Option<Vec<Vec<T>>` (v5)
  - `bool`
  - `()`
  - `[T; N]` (#1682)

# Options

## Add semantics for additional types in a breaking change

This perhaps the most straightforward way to do it. But it has several downsides:

1. It requires breaking changes, which might not be caught by the compiler
1. It adds more "magic" to how types communicate information to the derive macro
1. It increases the importance of being able to opt

## Add an additional argument to the arg attribute

We could add one or more additional arguments to the `arg` attribute on fields.

There are a few ways this could be done:

1. Have a different argument for each kind of semantics, such as `optional`, `all_values`, `optional_values`, `occurrences`, `optional_occurrences`, etc.
2. Add a single new argument that is given an enumeration value that is effectively the same as the current `clap_derive::utils::ty::Ty` enum (although we might want to use more descriptive names)
3. Add a single new argument that is passed an instance of a trait that specifies how to extract the value from the `ArgMatches`, see [User-extensible trait](#user-extensible-trait)
4. Add a single new argument that opts in to treating all supported standard types specially.

1-3 have the downside that either the attribute must agree with the type of the field, or there has to be some magic conversion of the field type.

4 would make it difficult to express sitiutations `Option<Vec<MyType>>` where the value parser parses a `Vec<MyType>`, but the `Option` is because the option is optional. It also still has the backwards
compatibility problem if we add anything new in the future.

### User-extensible trait

For 3 above, we can define a trait that encapsulates the wanted behavior, and give the derive macro an implementation of that trait. While it can be defined for standard types in clap_derive, it would
also be possible to allow users to create their own implementations, opening up additional ways to expand this functionality in other crates.

Note: bikeshed on the actual names

```rust
trait AggregateParser {
  type Aggregate;

  /// Make any necessary changes to the `Arg` before it is used in a `Command`
  fn modify_arg(&self, arg: clap::Arg) -> clap::Arg {
    arg
  }

  fn get(&self, matches: &mut clap::ArgMatches, name: &str) -> Self::Aggregate;
}
```

And one possible implementation would be:

```rust
struct OptionalValues<T>(PhantomData<T>); // goes with `Option<Vec<T>>`
impl<T> AggregateParser for OptionalValues<T> {
  type Aggregate = Option<Vec<T>>;

  fn modify_arg(arg: clap::Arg) -> clap::Arg {
    if arg.is_positional() {
      arg.num_args(1..)
    } else {
      arg
    }
  }

  fn get(matches: &mut clap::ArgMatches, name: &str) -> Self::Aggregate {
    matches.remove_many::<T>(name).map(|v| v.map(Iterator::collect))
  }
}
```

This could potentially use  GATs, so that it isn't necessary to include type parameters to the type itself.

### Expected Example Code

```rust
// 1. different arguments
#[arg(long, optional)]
a: Option<u32>,
#[arg(long, flag)]
b: bool,
#[arg(long, occurrences)]
c: Vec<Vec<String>>,
#[arg(long, optional_option)]
d: Option<Option<String>>,
// etc.

// 2. enumeration
#[arg(long, aggregation = AggregationType::Optional)]
a: Option<u32>,
#[arg(long, aggregation = AggregationType::Flag)]
b: bool,
#[arg(long, aggregation = AggregationType::Occurrences)]
c: Vec<Vec<String>>,
// etc.

// 3. trait
#[arg(long, aggregation = Optional<u32>)]
a: Option<u32>,
#[arg(long, aggregation = Flag)]
b: bool,
#[arg(long, aggregation = Occurrences<String>)]
c: Vec<Vec<String>>,

// 4. single arg
#[arg(long, magic_types)]
a: Vec<Vec<String>>
```


## Support parsing a macro invocation inside the struct definition

This might look like:

- `optional!(T)` is equivalent to the current `Option<T>`
- `values!(T)` is equivalent to the current `Vec<T>`
- `flag!()` is equivalent to the current `bool`
- `occurrences!(T)` would expand to `Vec<Vec<T>>` and opt in to getting a vec of values for each occurrence.
- either have `optional_vec!(T)` and `optional_occurrences!(T)` or allow `optional!(values!(T))` and `optional!(occurrences!(T))`.
- Maybe for completeness also have `required!(T)` which is the same as T but would make the option required?

These all use the Value type `T`, but it could also use the final type instead (for example: `occurrences!(Vec<Vec<T>>)`)

### Custom macros

This method could potentially support custom behavior by delegating to actual macro invocations in the generated impl. If the macro used looked like `example!(T)` then the generated code would have the following:

- When implementing `CommandFactory` invoking the macro like: `example!(T, arg: arg_value)` where `arg_value` is the `Arg` for the field, and it returns a new `Arg` to actually use, so that it can make modifications to the argument.
- The type of the field in the struct itself would be `example!(T)`
- When implementing `FromArgMatches`, to get the value of the field, invoke `example!(T, matches: matches_val, name)` where `matches_val` is an `ArgMatches` and `name` is the name of the option to extract.


Note the similarity between this and the custom trait in the section on using an attribute

### Expected Example Code

```rust
#[arg(long)]
a: optional!(u32),
#[arg(long)]
b: flag!(),
#[arg(long)]
c:  occurrences!(String),
#[arg(long, optional_option)]
d: optional_option!(String),
```

## Use newtypes

Wrapper types might be the least confusing, since they would be clearly intended for the purpose of communicating how the field should be parsed. However, they are probably the least ergonomic to use, since you would often have to unwrap the value. Although... I'm not sure how to determine for sure that the path actually points to the right type, and not a user-defined type with the same name, without require the type to be fully specified.

From @epage:

> One option would be to use deref specialization. We could move some (or all?) of the type handling to a macro that takes in a field's type and uses deref specialization to find the right Arg policy type which would have a function to apply that policy to an Arg


Another possibility is we could have the newtypes implement a trait similar to the one defined above in [User-extensible trait](#user-extensible-trait) (but with `Self` instead of `Self::Aggregate`) for the newtypes. Tha could also potentially make it user-extensible, but we still have the question of how to identify if a type is one of these types and extract the type for the value parser. Perhaps it could be combined witht he autoref specialized macro @epage mentioned above? Or use an attribute. Probably if it is user-extensible we would require that it has 0 or 1 type parameters, and if it is 1, use that to infer the type for th value parser. Or maybe we could use an associated type?

### Expected Example Code

```rust
#[arg(long)]
a: Optional<u32>,
#[arg(long)]
b: Flag,
#[arg(long, occurrences)]
c: Occurrences<String>,
#[arg(long)]
d: OptionalOption<String>,
```


Footnote:
I haven't put a lot of thought into the specific names for types, macros, attribute arguments etc. We can hash out better names for those once we decide on the general direction.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Improve how we infer behavior from field types #4626

Requirements

Options

Add semantics for additional types in a breaking change

Add an additional argument to the arg attribute

User-extensible trait

Expected Example Code

Support parsing a macro invocation inside the struct definition

Custom macros

Expected Example Code

Use newtypes

Expected Example Code

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

Improve how we infer behavior from field types #4626

Description

Requirements

Options

Add semantics for additional types in a breaking change

Add an additional argument to the arg attribute

User-extensible trait

Expected Example Code

Support parsing a macro invocation inside the struct definition

Custom macros

Expected Example Code

Use newtypes

Expected Example Code

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions