Hello everybody,
i merged all four steps of the funnel, in order, using a series of left merges . Save the results to the variable all_data, as the project asked.
And if you research the tables visits and all_data using .info(). You get following infomations:
[table visits]
RangeIndex: 2000 entries, 0 to 1999
Data columns (total 2 columns):
user_id 2000 non-null object
visit_time 2000 non-null datetime64[ns]
dtypes: datetime64ns, object(1)
memory usage: 31.3+ KB
[table all_data]
Int64Index: 2372 entries, 0 to 2371
Data columns (total 5 columns):
user_id 2372 non-null object
visit_time 2372 non-null datetime64[ns]
cart_time 720 non-null datetime64[ns]
checkout_time 598 non-null datetime64[ns]
purchase_time 497 non-null datetime64[ns]
dtypes: datetime64ns, object(1)
memory usage: 111.2+ KB
Why the table visits have 2000 rows but all_data 2372 rows?