webMethods

Join this online group to communicate across IBM product users and experts by sharing advice and best practices with peers and staying up to date regarding product enhancements.

View Only

Back to discussions

Expand all | Collapse all

How to remove duplicate records in flat file?

1. How to remove duplicate records in flat file?

Like
webMethods Community Member
Posted Tue March 19, 2024 04:25 AM

Reply
Hi,

I have a flat file with a single field consisting of personnel numbers. I would like to remove duplicate records in this flat file.

Inside designer, I converted the flat file into IS document type and sorted the records based on ascending order using pub.document:sortDocuments. Then, I did a LOOP step on the IS document type with a BRANCH step inside. The condition in the branch step is when current iteration’s record is not equal to the next interation’s (trying to make this dynamic by adding a variable called iteration2) record, I will append it to another document list. After each iteration, I will add 1 to the variable “iteration2”.

An example of the condition in the branch step: %DocumentDT/recordWithNoId/Employee Number% != %DocumentDT/recordWithNoId[%interation2%]/Employee Number%

However, this is not working. I would like to check if this the best way to remove duplicate records in designer? or do you have a better suggestion on how to remove duplicate record in a flat file?

Thank you.

#webMethods
#Service-Designer
#Managed-File-Transfer
#Integration-Server-and-ESB
#B2B-Integration
2. RE: How to remove duplicate records in flat file?

Like
webMethods Community Member
Posted Tue March 19, 2024 11:35 AM

Reply
Is it really just a file with a single column of personnel number? Nothing else? If so, don’t need to FF parse this, can use simplified mechanism to convert to a string list and use Java classes to help remove duplicates. Can you share more details?

#Managed-File-Transfer
#Integration-Server-and-ESB
#webMethods
#B2B-Integration
#Service-Designer
3. RE: How to remove duplicate records in flat file?

Like
webMethods Community Member
Posted Wed March 20, 2024 12:14 AM

Reply
Hi Hwee Yee,

For situations where we’re dealing with a single column, utilizing the hashtable service could be advantageous. By creating a hashtable object and populating it with the records, uniqueness is automatically ensured. This allows for easy retrieval of the unique records as a list.

#B2B-Integration
#webMethods
#Integration-Server-and-ESB
#Managed-File-Transfer
#Service-Designer
4. RE: How to remove duplicate records in flat file?

Like
webMethods Community Member
Posted Wed March 20, 2024 09:55 AM

Reply
Even with a multi-column file, a Hash* class can be helpful to eliminate duplicates. But a key aspect of the question is how big is the data expected to be. If “too big” then other techniques will be needed.

#Integration-Server-and-ESB
#webMethods
#B2B-Integration
#Managed-File-Transfer
#Service-Designer

webMethods

webMethods

How to remove duplicate records in flat file?

webMethods Community MemberTue March 19, 2024 04:25 AM

webMethods Community MemberTue March 19, 2024 11:35 AM

webMethods Community MemberWed March 20, 2024 12:14 AM

webMethods Community MemberWed March 20, 2024 09:55 AM

1. How to remove duplicate records in flat file?

2. RE: How to remove duplicate records in flat file?

3. RE: How to remove duplicate records in flat file?

4. RE: How to remove duplicate records in flat file?

Additional
Resources

Office

Quick Links

webMethods

webMethods

How to remove duplicate records in flat file?

webMethods Community MemberTue March 19, 2024 04:25 AM

webMethods Community MemberTue March 19, 2024 11:35 AM

webMethods Community MemberWed March 20, 2024 12:14 AM

webMethods Community MemberWed March 20, 2024 09:55 AM

1. How to remove duplicate records in flat file?

2. RE: How to remove duplicate records in flat file?

3. RE: How to remove duplicate records in flat file?

4. RE: How to remove duplicate records in flat file?

Additional Resources

Office

Quick Links

Additional
Resources