AAVSO HOME > news
 
 

News
Current News Archives
2006 Archive
2005 Archive
2004 Archive
2003 Archive
2002 Archive
2001 Archive
2000 Archive
1999 Archive
1998 Archive
 
Main sections of web
The AAVSO
Variable Stars
Observing
Access Data
Publications
Support
Education and Outreach
 
Pick a star

Create a light curve
Recent Observations
Find charts     
VSX
 

Data Validation: What is it?

Simply put, data validation is the process by which AAVSO Headquarters staff scrutinizes observations in the AAVSO International Database for any possible errors.

The purpose of data validation is not to produce a pretty light curve devoid of scatter, but rather to ensure that no unintentional errors creep their way into the database. Such errors include: designation/name discrepancies; comment field code and observer initial oversights; and JD or magnitude problems. Fortunately, many of these types of errors are no longer encountered since computer software has become much more sophisticated than it was in the early days of computerization.

AAVSO Data
As the largest and highest quality digital database of variable star observations available, thousands of researchers, educators, and students have used the wealth of information contained in the database for both personal and professional projects. As an impressive testament to the work and dedication of its observers, the AAVSO fulfills thousands of requests for data annually. While the AAVSO maintains a strict quality-control policy to ensure error-free data for such requests, there had never been a systematic survey of all the data to look for potential and obvious errors, and to investigate and rectify any problems. That is, until the Data Validation Project.

The Data Validation Project

Data Validation Project

In 2002, NASA awarded the AAVSO a 2-year grant to error check or validate observations for 4,922 stars in the AAVSO International Database from the founding of the organization in 1911 through 2001. The stars chosen for validation were of many types, with the heaviest concentrations coming from the pulsating (66%) and the eruptive-type (23%) variables. Eclipsing binary, RR Lyrae, and comparison star data, as well as select stars with complex histories or light curves, were not included at this time. Nonetheless, the project encompassed the validation of over 10 million observations!

A very important part of the Data Validation Project was to find, investigate, and resolve discrepant observations, mostly by comparison with the observer's submitted report. In fact, out of the observations validated, 633,126 were considered to be discrepant and 441,879 (70%) of those were repaired by changing data fields to match the observer's original reports.

Sara looks up data points
Technical assistant, Sara Beck, wades through the sea of observer reports at HQ to look for an original paper report.
Validating AAVSO Data
Technical assistants, Kerri Malatesta, Gamze Menali, and Mike Saladyga discuss data validation.

If a suspicious data point could not be resolved, it then became subject to a strict set of rules agreed upon by the Director and the Technical Staff. During this phase, the Technical Staff scrutinized the data by looking at each individual light curve included in the project. Points that would negatively affect the analysis of data to a statistically significant level were flagged discordant, as represented by an editorial letter code inserted into the digital record. None of the data deemed discrepant were ever deleted from the permanent archives. The data remain in the AAVSO International Database and are available upon request.

Mission Complete!

The Data Validation Project was completed in September of 2004, meeting the 2-year deadline. By the close of the project and with 9,324 staff hours logged, over 10 million observations contributed by over 6,000 observers worldwide were made available via the AAVSO web site. Since that time, well over 4,000 downloads of validated data have been made. Not only has this project significantly cut down on the amount of data requests received at HQ, but it has also given users immediate access to the treasures contained in the AAVSO International Database!

Light curve before and after
R Trianguli light curve sample before and after validation data correction-questionable data points are circled. Click image to see complete figure

The validated data for each star may be viewed graphically online through the AAVSO Light Curve Generator or may be downloaded as a data file . Interested observers may wish to make use of the data usage report program which sends an e-mailed monthly log stating the number of times any of their data was accessed via the online data download tool.

Data Validation Continues

Kerri reviews the data
Technical assistant, Kerri Malatesta, reviews one of thousands of light curves contributed to by AAVSO observers.
As a follow-up to the original Data Validation Project, work is currently underway to fill in the gap of unvalidated data from the initial project cutoff date (December 31, 2001) to the most recent month of archived data, making the dataset for a given star as complete and up-to-date as possible. Presently, stars included in this follow-up project are referred to as "priority stars." Such stars are those for which the AAVSO receives frequent data requests, stars that are easy to observe, stars that are contained in the AAVSO Bulletin, as well as other select new and interesting variables. Updates for these stars (over 1,000 in total) occur on a monthly basis.

Future plans also include the validation of stars not included in the original project, such as Eclipsing Binary, RR Lyrae, suspicious comparison stars, and more.

How You Can Help

The ultimate purpose of data validation is to make sure that an observer's data points — your data points — are recorded accurately. Although there are many checks and balances that take place prior to adding data to the archives, some errors, such as JD and magnitude may still make it through.

You can help with the data validation process by doing your own, smaller scale error investigation prior to submitting your observations to the AAVSO. Before sending off your report, quickly check to see if the JD is correct and that you haven't skipped ahead or behind by a month or even a year, that your 14.2 magnitude isn't inadvertently being reported as 4.2, that the CCDV magnitude that you have recorded isn't actually a CCDR, and so forth.

Together, we can continue to make the AAVSO International Database live up to its excellent, high quality reputation!

 
  search engine |  site map |  links |  contact us