Data profiling from file and show the results in browser

Five Stars rm
Five Stars

Data profiling from file and show the results in browser

Gurus,
We are using Big Data Enterprise edition. Have specific requirement, where user can search for the file and click that file in browser, he should get the information about the file in browser in Tabular form.
-->Top & tail 'n' rows of file with header
-->Total rows in file
-->Total columns
-->File metadata like column names, datatype, Length, Max value, Min value
Any way to accomplish this task in Talend? Kindly suggest.
Thanks
Moderator

Re: Data profiling from file and show the results in browser

Hi,
Do you want to create a webservice and post your data on it? Which includes
-->Top & tail 'n' rows of file with header
-->Total rows in file
-->Total columns
-->File metadata like column names, datatype, Length, Max value, Min value?
Could you please elaborate your case with an example with input and expected output values?
Best regards
Sabrina
--
Don't forget to give kudos when a reply is helpful and click Accept the solution when you think you're good with it.
Five Stars rm
Five Stars

Re: Data profiling from file and show the results in browser

Sorry for the delay in response.
Do you want to create a webservice and post your data on it? Which includes
-->Top & tail 'n' rows of file with header
-->Total rows in file
-->Total columns

Yes

-->File metadata like column count, column names, file-size, datatype, Length, Max value, Min value?

Yes.

Could you please elaborate your case with an example with input and expected output values?

Say for instance,  I have a file named X.txt and it has 4 columns namely COLA, COLB, COLC C, COLD with or without header.
1|3|5|7
2|4|6|8

User can select the file X.txt using drop down. Expecting following features; 
-->They can preview the data, either using head or tail 
COLA|COLB|COLC|COLD
1|3|5|7
2|4|6|8

-->Total columns, Total rows, File size

Total Columns|Total rows|File size
4|2|3KB

-->Run time stats of previous run of job.
-->Minimum value, Maximum value
Field name|Column position|Total count|Null count|Percentage of Null|Minimum value|Maximum value
Amt|3|2|0|0|5|6

-->Frequency distribution:
Column|Column Values|Count|Percentage
COLA|1|1|50|COLA|2|1|50

 
Sabrina, Is this can be accomplished? 
Thanks