Hi Sam,
1GB Flat File is indeed a large file and special care should be taken to handle this type of file, considering the fact that IS JVM cannot have more than ~2.5GB allocated.
Below is how I have done to process the file of this size > 1GB containing no. of records > 500,000:
1> To avoid any overhead, used only IS (no Broker, no TN, no Modeller etc…).
2> Wrote a fileSplitter Java service. This service reads the input file as stream, goes through each record to do some validations and creates bunch of tmp files each of records 20,000 (configurable, supplied as input to this service) and creates the output as the list of tmp files thus created.
3> Main flow then processes these files 1 at a time, creates the desired output file and deletes the temp file after processing: all in a loop. The data in the output file is appended each time in the loop.
4> Found that the split size of 20,000 records were optimal (Total processing time < 2 hrs.). Setting it to higher or lower value increased the total processing time.
5> Solution is scalable e.g. if the input file size grows in future, the splitter will create more temp files but the IS will handle only small chunk of data (20,000 recs) at a time and so will not go out-of-memory.
Your integration could be totally different than what I had but wanted to give you some pointers to ponder over while handling a large file > 1GB.
HTH,
Bhawesh.
#Adapters-and-E-Standards#webMethods#Integration-Server-and-ESB