LINQ Mastery

Distinct & DistinctBy: Mastering unique sets

1 Views Updated 5/4/2026

Removing Duplicates

Redundant data is a waste of processing power. Distinct and DistinctBy are your tools for ensuring every record in your stream is unique.

1. Distinct (Primitive and Reference)

For primitives (int, string), Distinct() works out of the box. For complex objects, it uses Reference Equality. **Architect Note:** If you want Distinct() to work on objects by value, you MUST implement IEquatable or provide a custom IEqualityComparer.

2. DistinctBy (Added in .NET 6+)

The modern way to filter unique objects based on a specific property. No more custom comparers needed!


// Get unique users based ONLY on their Email
var uniqueUsers = users.DistinctBy(u => u.Email);
    

3. Architect Insight

Q: "Is Distinct expensive?"

Architect Answer: "Yes. Under the hood, Distinct builds a internal **HashSet**. This has an O(N) memory cost. For massive streams, consider using a database-level DISTINCT or GROUP BY to ensure the uniqueness is handled at the storage layer before it flows into your application."

LINQ Mastery
General
Introduction to LINQ Mastery
1. Core Foundations
LINQ Fundamentals: Why LINQ? IQueryable vs IEnumerable: The Architect's choice Expression Trees: The power behind LINQ providers Method Syntax vs Query Syntax: Trade-offs
2. Filtering & Transformation
Where & Select: The bread and butter SelectMany: Flattening complex hierarchies OfType vs Cast: Handling heterogeneous collections Distinct & DistinctBy: Mastering unique sets
3. Aggregation & Quantifiers
Any, All, Contains: The boolean quantifiers Count, LongCount, Sum: Basic aggregations Min, Max, Average: Statistical operations Aggregate: The 'Fold' function of .NET
4. Ordering & Partitioning
OrderBy & OrderByDescending: Sorting data ThenBy: Multi-level sorting Take & Skip: Pagination strategies TakeWhile & SkipWhile: Dynamic partitioning
5. Sets & Lookups
Union, Intersect, Except: Set theory in C# Zip: Combining two streams ToDictionary vs ToLookup: One-to-One vs One-to-Many Chunk: Slicing data for batch processing
6. Join & Grouping
Inner Join: The standard match GroupJoin: Creating hierarchical results GroupBy: The SQL counterpart in LINQ Left Outer Join: The manual workaround in LINQ
7. Advanced Providers & Parallelism
PLINQ (Parallel LINQ): Speeding up CPU-bound queries AsParallel vs AsSequential: When to switch LINQ to XML: Processing documents with ease Custom LINQ Providers: How to build your own 'Queryable'
8. Real-world Performance & Patterns
Memory Leaks in LINQ: Capturing variables and closures Architect Case Study: Optimizing a multi-join dashboard query