您好,欢迎光临本网站![请登录][注册会员]  
文件名称: ETL 架构师 面试试题及答案
  所属分类: C
  开发工具:
  文件大小: 91kb
  下载次数: 0
  上传时间: 2009-10-11
  提 供 者: ljw_*****
 详细说明: 1. 分析 a. What is a logical data mapping and what does it mean to ETL team? b. What are the primary goals of the data discovery phase of the data warehouse project? c. How is the system-of-record determined? 2. 结构 a. What are the four basic data flow steps of an ETL process? b. What are the permissible data structures for the data staging area? Firefly describe the pros and cons of each c. When should data be set to disk for safekeeping during ETL? 3. 抽取 a. Describe techniques for extracting from heterogeneous data source b. What is the best approach for handling ERP source data? c. Explain the pros and cons of communication with databases natively versus ODBC d. Describe three change data capture(CDC) practices and the pros and cons of each 4. 数据质量 a. What are the four broad categories of data quality checks? Provide an implementation technique for each. b. At which stage of the ETL should data be profiled? c. What are the essential deliverables of the data quality portion of ETL? d. How can data quality be quantified in the data warehouse? 5. 建立对应 a. What are surrogate keys? Explain how the surrogate key pipeline works. b. Why do dates require special treatment during process. c. Explain the three basic delivery steps for conformed dimensions. d. Name the three fundamental fact grains and describe an ETL approach for each. e. How are bridge tables delivered to classify groups of dimension records associated to a single fact? f. How does late arriving data affect dimension facts? Share techniques for handling each 6. metadata a. Describe the different type of ETL metadata and provide examples of each. b. Share acceptable mechanisms for capture operational metadata. c. Offer techniques for sharing business and technical metadata. 7. 优化 a. state the primary types of tables found in a data warehourse and the order which they mush be loaded to enforce referential integrity. b. What are the characteristics of the four levels of the ETL support model? c. What steps do you take for determine the bottleneck of a slow running ETL process? Describe how to estimate the load time of a large ETL job? ...展开收缩
(系统自动生成,下载前可以参看下载内容)

下载文件列表

相关说明

  • 本站资源为会员上传分享交流与学习,如有侵犯您的权益,请联系我们删除.
  • 本站是交换下载平台,提供交流渠道,下载内容来自于网络,除下载问题外,其它问题请自行百度
  • 本站已设置防盗链,请勿用迅雷、QQ旋风等多线程下载软件下载资源,下载后用WinRAR最新版进行解压.
  • 如果您发现内容无法下载,请稍后再次尝试;或者到消费记录里找到下载记录反馈给我们.
  • 下载后发现下载的内容跟说明不相乎,请到消费记录里找到下载记录反馈给我们,经确认后退回积分.
  • 如下载前有疑问,可以通过点击"提供者"的名字,查看对方的联系方式,联系对方咨询.
 相关搜索: etl面试
 输入关键字,在本站1000多万海量源码库中尽情搜索: