TrainingConferencesAbout UsContact UsAdvertiseSQE.comRSS Feed

StickyMinds.com: brain food for building better software

Log In
 Clarify Your Search Criteria

Tips on Using Our Search Feature(s)
 
StickyMinds.com Home
ResourcesTopicsCommunityPowerPassBlogs
Home  >  Detail: Keeping Secrets



A StickyMinds.com Original
Article Picture
Keeping Secrets
How Data Privacy Affects Testing

By Linda Hayes

Send This Content to a FriendGet a Short Link to This ContentPrint This ContentSee User Comments About This Content

Summary: Test data has long been a challenge for testing; privacy legislation, identify theft, and the continued trend towards outsourcing has made it even worse. Just establishing and maintaining a comprehensive test environment can take half or more of all testing time and effort. In this week's column, Linda Hayes adds in the new and expanding privacy laws that inevitably limit your testing options. Yet from the quagmire of laws and company standards, better testing can emerge.


TechExcel, Inc.
In the old days, production could provide a refresh from time to time for your test bed. Although this was not easy, it was a starting point. You still had to make sure you either had the storage to get a full copy or could extract a coherent subset, and of course you could not reuse data because it was a moving target. And even though production was arguably a good sample, there was no guarantee that all your potential test conditions existed. Finding the right conditions to satisfy a test was the proverbial needle in a haystack.  
 
This was already hard, so what do you have to worry about now? 
 
Your Health  
For starters, when you visit the doctor you have yet more paperwork to complete, authorizing (or not) the disclosure of your medical information, thanks to the Health Insurance Portability and Accountability Act of 1996. The rules set by this act highly restrict who can access patient data, as well as when and why. Since your medical provider has to have permission to disclose your data to other health professionals or family members, it can't be provided to perfect strangers who are testing the latest release of software for a health insurance company or hospital management firm. 
 
As a tester, if your application touches this information in any way, you can't use unconditioned production data for testing. You can scrub names or social security numbers, which is not a necessarily new technique (although it's not always done). But now you have to worry about less obvious elements that can give the patient away, for example, a policy or claim number can tie back to an actual person through another system.  
 
Your Money 
Grimm-Leach-Bliley may not be as well known as Sarbanes-Oxley, but Title V of that act directly addresses the privacy of your financial information. Aside from certain exceptions like the Fair Credit Reporting Act, your financial institution has to publish its privacy policy and allow you to opt-out of certain disclosures. They also can't provide your information to testers in the U.S. or elsewhere. The European Union has even more stringent financial privacy regulations. 
 
That means any software that touches money or where it lives also has to be kept private. Again, not just your social security number, but other data that could be traced back to you even indirectly. Bank or credit card numbers are obvious, but what if an order number could be tied back to an invoice that could identify you? 
 
Your Identity 
All of this comes back in some way or another to your identity. Identify theft is a huge problem. Consumers and credit card companies are getting smarter and Web-exposed systems are becoming more secure, but all of the software that powers these relationships has to be tested, which means testers may have access to data that they otherwise would never be authorized to see. 
 
I recall one project where we tested upgrades to a human resource system. We had to battle with the system administrator to give us supervisory password access to the test server so we could theoretically hire and fire employees, change their salaries, etc. The network support group resisted because the information was highly confidential, but we argued that the testing region was different from production. As a practical matter, we had to exercise every available function. As a compromise, we received regular refreshes from production, which we had to scrub.  
 
Your Location 
Granted, legislation is catching up and law enforcement is waking up, so protection is becoming a priority...at least in some places. But what if testing is done in another country where these mechanisms aren't as available or enforceable? If your company sends its testing to another firm in another country, then you're entering a whole new level of disclosure. While your bank technically hasn't disclosed data to another institution if the data finds its way into an internal test region, sending it across the world to an unrelated entity is completely different. 
 
Hidden complications can also come in the form of organizational constraints meant to provide additional protection, as usually found within domestic firms. "Chinese walls" and other internal controls limit the exchange of information between departments or companies. But what if you are using a gigantic outsourcing firm that services hundreds of systems for multiple enterprises? It is conceivable that data not meant to be commingled could be. 
 
Silver Lining 
As an eternal optimist, I do see a silver lining in all of this. It is forcing companies to examine the entire issue around test data. A customer recently described a major project designed to create a test data region from the ground up. Not only did this solve privacy concerns, it also defined the data needed and reused every time, improving coverage and saving tremendous time. 
 
If your company hasn't come to grips with privacy issues yet, I encourage you to do so and fast. It isn't easy to fix, but once the genie is out of the bottle, it may be impossible.


About the Author
Linda G. Hayes, B.B.A., C.P.A., M.S., J.D. has twenty years of experience in software development, is a frequently published author and highly rated speaker on software quality and test automation. As co-founder of AutoTester, Inc., a leading automated testing software vendor, she pioneered structured software test automation. Her article on integrating automated testing throughout the software development cycle won the Most Significant Contribution of the Year award from the Quality Assurance Institute and was published by Auerbach in the testing chapter of their Systems Development Handbook. Linda can be reached at linda@worksoft.com.

Back to Top
 
 

Member Comments
Add Your CommentExpand Comments
 
Comment:    
by Kathy Van Stone 5/4/2005

I am admittedly a developer more than a tester, but for seed data we tend to use obvious fake data (for example 000-11-2222 for a social security number). We do have a method of scrubbing production data for when there is a error that needs debugging. I handed your article to the person in charge of that to see if he may have missed something. Thanks for the article.

Author's Response:
5/9/2005    
Hi Kathy - you are most welcome, thanks for taking the time to comment!

 
 
Comment:    
by Jeremy Sloan 5/4/2005

As a software QA department for a financial company, data privacy is an issue that we have to deal with all the time. Test automation can really help, in some cases. When you have the ability to load a created test bed entirely from a set of Excel worksheets or delimited text files, privacy concerns disappear and you also have far more control over your test conditions. Building a comprehensive suite of test data from scratch is a lot of work, though. Even with automated testing tools available, it's not always practical (or even possible) to avoid using production data for testing - especially for production support. In order to...Read On

Author's Response:
5/9/2005    
Hi Jeremy - agreed that it's rarely practical to start from a blank state, but if you do start with production then you might consider using some of the data scrubbing or "de-personalization" tools out there that will modify selected fields in a file or database so that the critical values within the real data is at least obscured.

 
 
Comment:    
by Michael McDonald 12/28/2004

I don't work for a bank or a healthcare company. Where do I go to find out what rules on data privacy apply to us?

Author's Response:
12/29/2004    
Hi Michael - check with your industry association, they usually track this type of legislation that affects their constituents.

 
 
Comment:    
by Frits Bos 12/27/2004

Hi, Linda, I like the way you put the question squarely to test managers, why on earth would you continue to rely on using production file extracts when that is a fundamentally flawed approach, let alone an approach that is likely to turn you into a law-breaker? I remember the IT history a bit farther back than you acknowledge. Having been around for a few years, as you know, I can attest to the fact that not only was finding the right test conditions difficult, typically the data did not exist. That was actually a blessing in disguise, because we did not have an opportunity to compromise customer data. So the “old days”...Read On

Author's Response:
12/29/2004    
Thanks for such a thoughtful and detailed response, Frits. I agree that this is not a new problem and it is amazing that it is taking legislation to make us do something we should have done for our own reasons: complete test coverage.

 
Back to Top



 
Ads By Google
What's This?
 
 



Home   |   Resources   |   Topics   |   Community   |   PowerPass



© 2010 StickyMinds.com. All rights reserved.
StickyMinds.com is a division of Software Quality Engineering.
Privacy Policy    Terms & Conditions    Link to StickyMinds.com    Feedback


TechExcel, Inc.




STAREAST 


Better Software Conference