Interview Questions Answers.ORG
Interviewer And Interviewee Guide
Interviews
Quizzes
Home
Quizzes
Interviews Data Warehouse Job Interviews:Ab InitioActuateAnalysis ServicesBO DesignerBusiness ObjectsCognos Data WarehouseColdFusionCrystal Enterprise SuiteData IntegrationData ScientistData StageData Warehouse AnalystData Warehouse BIData Warehouse Data MartData Warehouse DeveloperData Warehouse ManagerData Warehouse SupervisorData Warehouses MicroStrategyData WarehousingData Warehousing BasicsDatabase PartitioningEnterprise Data WarehouseETL (Extract, transform, load)General DatawarehousingHyperionImpromptuInformatica Data WarehousingLead Data ScientistMicro StrategyOperational Data StoreReportnetSAS (Statistical Analysis System)Warehouse AssistantWarehouse ConceptsWarehouse DataStageWarehouse Operation ManagerWarehouse Worker
Copyright © 2018. All Rights Reserved
Ab Initio Interview Question:
What is BRODCASTING and REPLICATE?
Submitted by: AdministratorBroadcast - Takes data from multiple inputs, combines it and sends it to all the output ports.
Eg - You have 2 incoming flows (This can be data parallelism or component parallelism) on Broadcast component, one with 10 records & other with 20 records. Then on all the outgoing flows (it can be any number of flows) will have 10 + 20 = 30 records
Replicate - It replicates the data for a particular partition and send it out to multiple out ports of the component, but maintains the partition integrity.
Eg - Your incoming flow to replicate has a data parallelism level of 2. with one partition having 10 recs & other one having 20 recs. Now suppose you have 3 output flos from replicate. Then each flow will have 2 data partitions with 10 & 20 records respectively.
Submitted by: Administrator
Eg - You have 2 incoming flows (This can be data parallelism or component parallelism) on Broadcast component, one with 10 records & other with 20 records. Then on all the outgoing flows (it can be any number of flows) will have 10 + 20 = 30 records
Replicate - It replicates the data for a particular partition and send it out to multiple out ports of the component, but maintains the partition integrity.
Eg - Your incoming flow to replicate has a data parallelism level of 2. with one partition having 10 recs & other one having 20 recs. Now suppose you have 3 output flos from replicate. Then each flow will have 2 data partitions with 10 & 20 records respectively.
Submitted by: Administrator
Broadcast is used to increased the data parallelism by connecting fan-out flow to the output.Replicate is used to increase the component parallelism by connecting multiple straight flow to the output.
Submitted by: Manjunatha
Submitted by: Manjunatha
Copyright 2007-2024 by Interview Questions Answers .ORG All Rights Reserved.
https://InterviewQuestionsAnswers.ORG.
https://InterviewQuestionsAnswers.ORG.