When to use tAggregateRow and when to use tSortRow + tAggregateSortedRow

Five Stars

When to use tAggregateRow and when to use tSortRow + tAggregateSortedRow

I am new to talend. Can anybody let me know when to use tAggregateRow and when to use tSortRow ->tAggregateSortedRow as both will give the same output.

 

 

Employee

Re: When to use tAggregateRow and when to use tSortRow + tAggregateSortedRow

Hi,

 

 Please refer below post where it is explained with an example.

 

https://community.talend.com/t5/Design-and-Development/resolved-taggregaterow-vs-taggregatesortedrow...

 

Warm Regards,
Nikhil Thampi

Please appreciate our Talend community members by giving Kudos for sharing their time for your query. If your query is answered, please mark the topic as resolved :-)

Employee

Re: When to use tAggregateRow and when to use tSortRow + tAggregateSortedRow

Hi,

 

tAggregateSortedRow aggregates the sorted input data for output column based on a set of operations.

 

tAggregateRow receives a flow and aggregates it based on one or more columns.

 

https://help.talend.com/reader/KxVIhxtXBBFymmkkWJ~O4Q/VVQYE5AV~OFaAnSfC13t2g

https://help.talend.com/reader/KxVIhxtXBBFymmkkWJ~O4Q/i_YvOl2oUaVpW1UTnJao_g

 

So the first one will avoid the sorting part since the assumption is that incoming data is already sorted. So for sorted data, tAggregateSortedRow will give better performance.

 

Warm Regards,
Nikhil Thampi

Please appreciate our Talend community members by giving Kudos for sharing their time for your query. If your query is answered, please mark the topic as resolved :-)

 

Five Stars

Re: When to use tAggregateRow and when to use tSortRow + tAggregateSortedRow

Thanks Nikhil for the reply. But if the source data is not sorted then which one of below two will give better performance.

1:-tAggregrateRow 

2:- tSortRow ->tAggregateSortedRow

 

Or both of them will give the same performance

Employee

Re: When to use tAggregateRow and when to use tSortRow + tAggregateSortedRow

Hi Shameer,

 

    It will be more or less same performance since you have to same actions (either within one component or through two components).

 

Warm Regards,
Nikhil Thampi

Please appreciate our Talend community members by giving Kudos for sharing their time for your query. If your query is answered, please mark the topic as resolved :-)

 

 

Nine Stars

Re: When to use tAggregateRow and when to use tSortRow + tAggregateSortedRow

Hi,

 

The 2 options you suggest have one major functional difference - if your incoming data set you're try to aggregate is large then using tSortRow & tAggregatedSortedRow is the only way to go. tAggregateRow has to maintain the data set in memory, so will have issues as the size of the data set increases. Use the sort on disk functionality in the tSortRow and you'll be fine.

 

Regards David
Dont forget to give Kudos when an answer is helpful or mark the answer as the solution.

2019 GARNER MAGIC QUADRANT FOR DATA INTEGRATION TOOL

Talend named a Leader.

Get your copy

OPEN STUDIO FOR DATA INTEGRATION

Kickstart your first data integration and ETL projects.

Download now

What’s New for Talend Summer ’19

Watch the recorded webinar!

Watch Now

An API-First Approach to Modernizing Applications

Learn how to use an API-First Approach to Modernize your Applications

Watch Now

Talend API Designer – Technical Overview

Take a look at this technical overview video of Talend API Designer

Watch Now

Getting Started with APIs

Find out how to get started with APIs

Read