Watson Discovery

 View Only

How to get a list of files ingested a Watson Discovery collection?

  • 1.  How to get a list of files ingested a Watson Discovery collection?

    Posted Mon October 12, 2020 10:24 AM
    Hi all, 

    I am working on a Watson Discovery collection containing >10,000 documents. How can I get a list of all documents ingested in the collection using the API? I am able to get a list of 10,000 items. But how can I get the rest? Python is being used to add documents to the collection. To get the file listing, I am using the query commands (query='*.*'). This is however limited to the retrieval of 10,000 docs. If using the offset and count parameters in the API call, the resultant message that appears is: "error" : "Result window is too large, count + offset must be less than or equal to 10000". I appears quite cumbersome to get the document list. Has anyone a solution for this problem?

    Best regards, 
    Joost

    ------------------------------
    Joost Vos
    ------------------------------

    #WatsonDiscovery