How estimated cardinality (EC) is calculated when explain analyze a query?

### What happens?

Hello!  I am testing the performance of read_parquet function , The number of data records is 143997065，Size of parquet files is about 29GB, every file  has 144 columns. But in my three cases, I got different results，the difference is significant。


### To Reproduce

**Case 1:**
29GB is a big parquet file,here is the profiling：
![image](https://github.com/duckdb/duckdb/assets/39759821/ec111bf1-02b0-4c34-853d-06eb650fd8f1)
![image](https://github.com/duckdb/duckdb/assets/39759821/516ea50a-f494-4ef5-8047-ebb30c18133c)

**Case 2:**
29GB parquet files are divided into 12 files，here is the profiling:
![image](https://github.com/duckdb/duckdb/assets/39759821/28084a12-f3e3-44ea-b05f-2bc4e9ac006b)
![image](https://github.com/duckdb/duckdb/assets/39759821/02266980-fff4-49c6-aad9-76c54cee88dc)

**Case 3:**
29GB files are divided into 128MB small files，about 268 files。
![image](https://github.com/duckdb/duckdb/assets/39759821/5f53bfe6-0121-47e0-9eb2-dd907fead01d)
![image](https://github.com/duckdb/duckdb/assets/39759821/ae9725b2-7915-4e04-81fa-cf47bc7ebc02)

The results are different, I compared the results of the above three cases, Case 1 is similar to case 3， case 2 is slower than others,
I noticed that estimated cardinality (EC) is different, EC resulted in different query plans. could tell me How estimated cardinality (EC) is calculated when explain analyze a query?




### OS:

windows10

### DuckDB Version:

0.9.2

### DuckDB Client:

duckdb_cli-windows-amd64.zip

### Full Name:

Tom

### Affiliation:

Strive

### Have you tried this on the latest [nightly build](https://duckdb.org/docs/installation/?version=main)?

I have tested with a nightly build

### Have you tried the steps to reproduce? Do they include all relevant data and configuration? Does the issue you report still appear there?

- [X] Yes, I have

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

How estimated cardinality (EC) is calculated when explain analyze a query? #10523

What happens?

To Reproduce

OS:

DuckDB Version:

DuckDB Client:

Full Name:

Affiliation:

Have you tried this on the latest nightly build?

Have you tried the steps to reproduce? Do they include all relevant data and configuration? Does the issue you report still appear there?

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

How estimated cardinality (EC) is calculated when explain analyze a query? #10523

Description

What happens?

To Reproduce

OS:

DuckDB Version:

DuckDB Client:

Full Name:

Affiliation:

Have you tried this on the latest nightly build?

Have you tried the steps to reproduce? Do they include all relevant data and configuration? Does the issue you report still appear there?

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions