LabVIEW

cancel
Showing results for 
Search instead for 
Did you mean: 

how to get name from resume in LabVIEW. How to read from PDF.

@wiebe: have you tried the PA-AI Builder text recognition?

CY (expired CLAD)
0 Kudos
Message 11 of 25
(941 Views)

@cy... wrote:

@wiebe: have you tried the PA-AI Builder text recognition?


No, but I have made a PDF toolkit 😋.

 

CRC would indeed be a way out, to get most text out of most PDFs. I don't think it's a strategy used my most text extraction tools.

0 Kudos
Message 12 of 25
(932 Views)

Want to read text from pdf.

0 Kudos
Message 13 of 25
(911 Views)

Wanted to read text from pdf.

0 Kudos
Message 14 of 25
(911 Views)

No, please guide me how to do this.

0 Kudos
Message 15 of 25
(910 Views)

please tell me how to use pdf tool kit  and where i can find it.

0 Kudos
Message 16 of 25
(906 Views)

The LabVIEW PDF Toolkits out there wouldn’t help you, not even very much  if you plan to disassemble them to learn how you have to parse a PDF file yourself in order to get that information. 

.They all do the opposite of what you want, creating a PDF document as the programmer calls specific functions to add text blocks, images, page and document formatting commands and more to it. This is already quite a lot of work but relatively manageable as you need to concern yourself only with the things you want to add. 

 

Reading a PDF document consistently is a lot mote complicated. PDF is a very rich document description language based on EPS which was designed to allow transfering any imaginable document to a printer and make the result look almost perfectly WYSIWYG. It means also that the resulting PDF document can be very different depending on the tool which created it but look in a perfectly implemented renderer exactly the same. Your PDF parser has to be prepared to face lots and lots of different possible syntax elements and it’s not always possible to skip unknown syntax elements as that can mess up your parser state.

Rolf Kalbermatter
My Blog
0 Kudos
Message 17 of 25
(897 Views)

wiebe@CARYA wrote

CRC would indeed be a way out, to get most text out of most PDFs. I don't think it's a strategy used my most text extraction tools.


Considering that PDF also allows compression of parts in its document and also encryption, that “most” could be still not enough.

Rolf Kalbermatter
My Blog
0 Kudos
Message 18 of 25
(890 Views)

@rolfk wrote:

wiebe@CARYA wrote

CRC would indeed be a way out, to get most text out of most PDFs. I don't think it's a strategy used my most text extraction tools.


Considering that PDF also allows compression of parts in its document and also encryption, that “most” could be still not enough.


Well, I do mean specialized "text from PDF" extractors. These would parse the PDFs, including decompression and even decrypting. Extracting or reading text from a PDF with notepad, or a tool that reads text from a file has little change.

 

The standard compressions is a simple inflate (IIRC there are ways to specify a custom compression algorithm). This is so common in PDFs, without support for this you'd be lucky if you can parse 2% of the PDFs.

 

The encryption only is more or less the same thing. If you only protect printing and editing, there isn't really any encryption. A tool could reverse this. Not sure how easy that is, I only did the decrypting (a while ago).

0 Kudos
Message 19 of 25
(869 Views)

message to OP: if you do not mind having a running cost in your resume processing, I would recommend another platform, read earlier posts for reference. otherwise, have the applicant fill in a standardized forms could be more feasible.

CY (expired CLAD)
0 Kudos
Message 20 of 25
(851 Views)