I am using the tFileInputFullRow to split the large csv file ( ~ 2GH) to many smaller files with the option 'Split output in serveral files'
If I change the option 'Use specific JVM arguments' to -Xmx4096M and -Xms2048M then I received the error "Exception in thread "main" java.lang.OutOfMemoryError: Java heap space"
If I increase the size to -Xmx8192M and -Xms2048M, I received the error message "java.lang.NegativeArraySizeException"
I try to change the size everywhere but still cannot make it work
Anyone please help, thanks
With your large csv file, are you able to load it by using tfileinputdelimited component?
'Split output in serveral files' option in Advanced setting of tfileoutputdelimited component will help you to split the input file based on row count.
I have done 2 things to fix my problem:
- I found that I was using the JRE 32 bit, so that I reinstall the 64bit
- The csv format, I select '\n' instead for '\r\n' and change the Encoding to ISO-885910-15
Then the job run smoothly
Talend named a Leader.
Kickstart your first data integration and ETL projects.
Find out about Talend's 2019 Summer release
Talend continues to revolutionize how businesses leverage speed and manage scale
Look at6 ways to start utilizing Machine Learning with Amazon We Services and Talend