We know that flipping a coin, very rarely (but it can happen), that we have 10 heads or tails in a row.

So we definitely need a much larger sample, my guess is at least a few hundred flips to reduce the margin of error to a few %.

