Ideally, I wanted to end the test when there was a statistically significant difference, using an alpha level of 1%, and power of 90%.

However, this will be my first time setting a power criteria for this test. So for calculating this, I was going to use the sample mean of each sample to calculate the power.

Is this going the right direction? Or is there some huge flaw in my reasoning?