Get PDF attachment content

View Only

Expand all | Collapse all

1. Get PDF attachment content

0 Like
Federico Camelino
Posted Wed September 20, 2023 10:10 AM

Reply
Hi,

I need to get the content of a pdf file in string format. I didn't find an application or Python code that helped me with this case. Can they help me?

Thanks, regards!

------------------------------
Federico Camelino
------------------------------
2. RE: Get PDF attachment content

0 Like
Priya Sapra
Posted Thu September 21, 2023 03:18 PM

Reply
Hi, while we don't have an app to get the content of a PDF as a string, you may find the Image OCR Functions for IBM SOAR app helpful as it's able to interpret text from image files.

------------------------------
Priya Sapra
------------------------------

Original Message
3. RE: Get PDF attachment content

0 Like
Pol Estecha Hernández
Posted Fri September 22, 2023 06:56 AM

Reply
Greetings,

There is no package ready to extract PDF information into a file, sadly.

But, you could obtain the contents of a PDF file (or any attachment, really) using the REST API Functionality:

https://exchange.xforce.ibmcloud.com/hub/extension/b1e4814282b33a826f36c72cf1bc4751

First, query to get all Incident Attachments using:

/orgs/{org_id}/incidents/{inc_id}/attachments

Obtain the ID of the desired attachment, and then get the content using:

/orgs/{org_id}/incidents/{inc_id}/attachments/{attach_id}/contents

Anyways you should be careful with this approach. Malicious code can be injected into PDF files and then executed upon reading it's contents. I'm not entirely sure if this can be triggered with GET actions specifically for PDS, but nonetheless you should still be cautious if you are going to read external /unsecure files.

Cheers!

------------------------------
Pol Estecha Hernández
------------------------------

Original Message
4. RE: Get PDF attachment content

0 Like
Federico Camelino
Posted Fri September 29, 2023 03:09 PM
| view attached (2)

Reply
Hello Pol, I was testing with the indicated function. I cannot obtain the content of the PDF file in plain text, but I can obtain it in JSON. I am attaching screenshots of the Playbook error and the configuration of the "Call REST API" function.

Thanks and regards!

------------------------------
Federico Camelino
------------------------------

Original Message
5. RE: Get PDF attachment content

0 Like
Federico Camelino
Posted Wed November 15, 2023 10:40 AM

Reply
Hi Pol and team,

Do they have a news for this topic?

I'm waiting, greetings!

------------------------------
Federico Camelino
------------------------------

Original Message

IBM Security

Join our 16,000+ members as we work together to overcome the toughest challenges of cybersecurity.

IBM Security QRadar SOAR

Get PDF attachment content

Federico CamelinoWed September 20, 2023 10:10 AM

Priya SapraThu September 21, 2023 03:18 PM

Pol Estecha HernándezFri September 22, 2023 06:56 AM

Federico CamelinoFri September 29, 2023 03:09 PM

Federico CamelinoWed November 15, 2023 10:40 AM

1. Get PDF attachment content

2. RE: Get PDF attachment content

3. RE: Get PDF attachment content

4. RE: Get PDF attachment content

5. RE: Get PDF attachment content

Join our 16,000+ members as we work together to
overcome the toughest challenges of cybersecurity.