This is a tough question and I'm just looking for help in it.
The questions reads: There are two large data files from different but partnering systems (example: United Airlines & Delta) and asked to create one data file that can be used to create a regression model for analysis. What steps you would take to merge the two data files. How do you ensure the validity of the results? What type of data management would you conduct to ensure the file is ready for analysis?
What I have so far is basically saying that I'd be sure the two partnering systems contain the same variables I would need to create the regression model and color code them in a way that they can still be individually identified as which originated from which system. How do I ensure the validity? And what type of data management would I need?
Please help! Thanks!
The questions reads: There are two large data files from different but partnering systems (example: United Airlines & Delta) and asked to create one data file that can be used to create a regression model for analysis. What steps you would take to merge the two data files. How do you ensure the validity of the results? What type of data management would you conduct to ensure the file is ready for analysis?
What I have so far is basically saying that I'd be sure the two partnering systems contain the same variables I would need to create the regression model and color code them in a way that they can still be individually identified as which originated from which system. How do I ensure the validity? And what type of data management would I need?
Please help! Thanks!