Questionnaire for ETL Requirements Gathering and Analysis Review Reference No.:
Review Date:
Review Reference Documents:
Sl. No. General Questions 1.
Questionaire
4.
What is the Primary Business Requirement of this system? Who are the Business Groups/ Users of the system? Any strategy in place to handle incremental data and SCD? What is the Projected growth of DWH?
5.
Pls specify 'out of scope' requirements
6.
Need client contacts for any clarifications
2. 3.
Questions on Existing System 7. 8. 9.
10. 11. 12. 13. 14. 15. 16. 17. 18. 19.
Please explain the current process / methodology followed in the existing system Please explain the current ETL architecture with the breakup of Development servers, QA servers and Production servers Is the system fully automated or any kind of manual intervention required (Ex: during extraction, data load etc). How about the new system? Any documentation available related to the existing system? Please provide access to the same Any project prototyping done? If yes, give details Any problems with mappings and resolution or any architectural challenges Are there any known data quality issues? Are there any issues/bottlenecks related to the ETL Process? Please indicate the number of existing Informatica mappings Please indicate the complexity distribution of current ETL mappings Please indicate if the current ETL jobs are pulling data from the source applications or it is being pushed into Informatica What would be the approximate volume of the data in the database? What is the batch load window being used today
Response
Questionnaire for ETL Requirements Gathering and Analysis Sl. No. 20.
Questionaire What is the database system used?
Questions on Estimation 21. 22. 23. 24. 25. 26.
27. 28. 29. 30. 31. 32.
33. 34.
Whether aggregate tables need to be created? If yes, how many and what subject areas? How many ETL routines/mappings required? Pls classify with complexity as simple, medium & complex. Definition of Simple , Medium & Complex. Pls provide the source system details, Name, Platform, Description Any Data Sharing Agreements with Source Data Owners needed? Staging Area : • What is the design of the Staging area? • How much of data is retained in the Staging area? Are there any, • Aggregations • Calculations • Denormalizations • Business Rules to be applied in the ETL transformations? If yes pls provide details. What is the type of extraction – Full / Incremental? If incremental, how do you identify data what data has changed? Are you using any specific tool for this? What is the volume of incremental data? What is the loading Mechanism to be used – Bulk Load/ Update-Insert. ETL Schedule – Daily/Weekly/Monthly and scheduling process Any performance constraints like Time window for data Extraction / Transformations / Loading? What should be the strategy on ETL Monitoring Processes? • Error Handling • Exception Handling • Level of Logging • Notification process What is the Security architecture of the application Is the security at the application level, report level or data level?
Thank you for interesting in our services. We are a non-profit group that run this website to share documents. We need your help to maintenance this website.