Hi fellow students and experts,
I have a question about the Biodiversity project that I’m currently working on.
There’s a column called ‘scientific_name’ and when I found that unique values in that column is 5541 by using the nuniuqe method, which is smaller than the total number of rows: 5824. The means that this column contains duplicates but scientific names are supposed to be unique, so I dug deeper to see what the duplicates are. I used .duplicated() to find the rows with duplicates and printed them but I don’t see any duplicates. Can anyone answer my question? you can find my repo below. Thanks!!!