Being A Data Testing Intern At Ellicium Solutions
I was always curious about data testing. For the same reason, I opted for post-graduation in the testing field itself. To make myself more acquainted with testing, I even pursued the international software testing qualification board [ ISTQB] certification, which helped me a lot to get familiar with manual testing concepts.
As Albert Einstein said, “Information is not knowledge the only source of knowledge is experienced”, I was craving for a real industry experience. In September 2017, I got one such opportunity to learn, explore, and experience. I got an internship at Ellicium Solutions!
After joining Ellicium, I went through an extensive training on SQL, which involved understanding complex queries as well as stored procedures. These trainings were fun at the same time way more advanced than what I learned during my college days. I realized the importance of having in-depth knowledge of SQL and why mastering it is a must for a data tester. Even a simple assignment of data analysis was a great learning and gave me a good understanding of how testing is done in an industry.
Learning data testing:
Being a Big Data company, it is essential to ensure data quality and data accuracy. For the same reason, data testing becomes a vital task. Even though I was good at manual testing, data testing was a kind of new thing for me. Training, as well as my team lead Abhilash helped me a lot to understand concepts of data testing. For the first time, I got direct exposure to a variety of datasets. It made me understand complexity at the same time vitality of data testing.
Below are the various aspects of data testing that I have learned till now:
Data Exploration –
As the saying goes, “The devil is in the detail“. Same holds true for data exploration part of data testing. I learned how to check data granularity. For the same reason, I became best friends with this super powerful tool called Microsoft Excel. Data exploration becomes a stepping stone for data analysis. The important take away here is that you need to dedicate a good amount of time doing data exploration, as further data testing depends on how well you have understood the data.
Data Analysis –
While working on one of the datasets, I found that it has data duplicity. With duplicate data, it becomes quite hard to compare data. To solve this issue, you need to merge two or more columns of duplicate data and consider them as a unique key.Against the manual testing, I learned how to save time and efforts. This basic concept was such an essential learning! By the way, one mistake here can cost a lot. It can affect data quality badly.
Data Quality –
I am working on sample data sets related to banking industry. Hence, data completeness and data accuracy are of paramount importance here. As mentioned earlier, my best friend Microsoft excel makes this task easy for me. Learning to ensure data quality was a bumpy ride. I guess you can’t learn this without failing first. Initial training on stored procedure helped me in automating the overall process of data quality assurance.
There are a lot of things in data testing which I am yet to learn. Every day is unique, and every data set gives you a new experience and learning. Thanks to mentors here at Ellicium, I have something to gain even from my failures. I have been a part of many crucial discussions involving the senior-most individuals from Ellicium. Apart from working on tasks, all these things have contributed a lot to make me a better data tester. I am confident that in the coming days will be as exciting as what I have had till now!