Proposal: "Closed" type hierarchies

# Background

This proposal is substantially similar to https://github.com/dotnet/roslyn/issues/188, but it focuses more on integrating with existing C# object structures and less on adding new "ADT"-style objects. The goal, however, is still very similar. By adding closed type hierarchies the compiler can be guaranteed that all the types its sees are the only subclasses of the target type. This can integrate with features like pattern matching to provide a warning when not all possible types are checked in a pattern match.
# Problem

Providing an ADT-like pattern is helpful for the pattern matching case but it unfortunately doesn't integrate well with current object-oriented patterns written in C#. For example, the Roslyn [TypeSymbol](http://source.roslyn.io/#Microsoft.CodeAnalysis.CSharp/Symbols/TypeSymbol.cs) type could be useful to be matched on, but the design as proposed in #188 would require rewriting or significantly modifying the existing type structure meet that goal. It would be better if, instead, the existing object hierarchy could be easily marked as `closed`, meaning that all inheriting classes must be in the same compilation unit as the definition.
# Solution

The cleanest solution would simply be to add support for a new modifier, `closed`, that would require all inheriting classes to be in the same compilation unit as the declaration and to be themselves marked as either `closed` or `sealed`. Interestingly, this can be easily extended for interfaces as well, where `closed` would imply that the implementing type would have to exist in the same compilation unit. This would encompass the previously proposed `InternalImplementationOnly` attribute without adding any new concepts to the language.

The main issue with this simple addition is that it would only be enforced in the compiler, probably by the presence of a custom attribute in metadata. This would mean that any language which doesn't understand `closed`, or even any previous version of the C# language, would not be under any constraints in implementing these types. Unfortunately, this causes significant issues with any language using `closed` as it would generally be seen to be a guarantee of the possible subtypes or implementations, when in fact any non-participating language would violate those assumptions.

However, attempting to encode the objects in such a way as to use the .NET object system to restrict usage has an unfortunate set of problems that I'll detail here.
## Issues with using .NET to encode `closed`

The first barrier is that .NET provides no method of restricting interface implementation, so applying `closed` to interfaces is impossible.

The second barrier is that .NET only provides a way of restricting inheritance of objects by limiting the visibility of a constructor. If an object is visible outside its declaring assembly, all the constructs must be marked `internal` or stricter in order to prevent inheritance outside the declaring compilation. If `InternalsVisibleTo` is in play, things become even more complicated -- the only way to prevent inheritance is to mark all constructors private and require that all classes inheriting from the `closed` class be inner classes. Assuming we don't want to reify that encoding into the language, this would require lowering definitions and usages to the appropriate class layout.

Requiring all classes to be `sealed` or only constructed inside the declaring compilation is a heavy restriction, but that's not the only limit. There's at least one other restriction in the C# type system: classes which do not have public default constructors cannot be used as parameters to a type parameter with the constraint `T: new()`. If we go down this path, there may be other complications in the same vein.

Now that I've listed the issues with using the type system, I'd like to discuss why not using the type system isn't as bad as it first seems.
## Failure scenarios

In this instance, "failure" is defined as a language which supports and understands the `closed` modifier encountering a type which should have been prohibited by the existence of `closed`. There are only a few instances where this is the case:
1. `Compilation A` defines `type A`, which is marked as `closed`. `Compilation B`, which is written using a language which does not understand `closed`, creates `subtype B`, which inherits from `type A`. This type is then passed back into `compilation A` via a public API. In this instance, we can see the `compilation B` is interacting with `compilation A` as both a consumer and a provider through its public API. The violation of `compilation A`'s public contract is the underlying cause of the failure, regardless of the support or lack thereof from the language. Many public APIs have contracts which cannot be expressed in the type system, like various argument out of range conditions, but this is not often seen as an unredeemable flaw in the language design. In addition, like out of range conditions, `closed` type violations could be checked at runtime at public API boundaries to provide early feedback and failure for misuse of the API.
2. `Compilation A` defines `type A`, which is marked as `closed`. `Compilation B`, which is written using a language which does not understand `closed`, creates `subtype B`, which inherits from `type A`. `Compilation C`, which supports `closed`, references `compilation A` and is referenced by `compilation B`. `Type B` is passed from `compilation B` to `compilation C`, where it proceeds to fail inside `compilation C`. Unfortunately, in this case the failure in `compilation C` is more of a violation of the contract of `compilation A` rather than `compilation C`, so efforts to shield against failures in `compilation C` is difficult and, in a sense, inappropriate.
3. Other failures are transitive failures of the above, for example a tertiary compilation passing a "bad" value into a compilation, which then passes it through to a failing compilation.

(1) is the case I believe is the least complicated and would be the most common. I also believe it is the case least worth worrying about. In addition to being similar to other misuses of a public API, the failure is caused directly by a compilation creating an invalid value based on a referenced API, then passing that invalid value back into the API. In this case 1) the constructor of the bad value should have been aware of the public contract of the referenced assembly because it is both consuming, augmenting, and utilizing it directly, and 2) the referenced API has the greatest ability to be "defensive" in its consumption.

The case in (2) is a bit more complex, both because the contract may be more implicit and the failing compilation may be less able to protect itself. However, that doesn't mean that the risk is completely unlimited. For one, compilations can limit their public API to not rely on types marked as closed. Also, the situation becomes more pathological as the compilation references and API contracts become more complicated, but they also become rarer. Overall, it seems that this failure mode is worth the significant benefit it provides.
# Conclusion

There are no trivial wins here, but it appears that a simple custom-attribute-enforced `closed` type modifier would provide substantial benefit that may outweigh the downsides. The only thing that seems clear is that attempting to provide a .NET-object guarantee with arbitrary closed hierarchies can get very complicated very quickly and is probably not worth the complexity and limitations it brings.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Proposal: "Closed" type hierarchies #8729

Background

Problem

Solution

Issues with using .NET to encode `closed`

Failure scenarios

Conclusion

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Proposal: "Closed" type hierarchies #8729

Description

Background

Problem

Solution

Issues with using .NET to encode closed

Failure scenarios

Conclusion

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

Issues with using .NET to encode `closed`