splitting PDF text to array

Question

Hi,I'm testing PDF file using PDFBox java class.Everything properly set up and could strip text from pdf.But problem is converting text to array...my function is like below&nbsp;function ABCD(){
  var docObj = loadDocument("E:\Temp\Report100.pdf");
   //Create a text stripper object to get text 
  var textStripperObj = JavaClasses.org_apache_pdfbox_util.PDFTextStripper.newInstance();
  var text = textStripperObj.getText_2(docObj);  
  Log.Message('',text);
  var textArray = text.split('');
  Log.Message(textArray.Length);
  for (var i=0; i&lt;25; i++){
    Log.Message( String(textArray[i])+ String(i));
  } 
}From log message I could see correct textbut not in textArray... when debug it shows like belowtried with split('
') ,&nbsp;&nbsp;split('\b')...it's not getting array values...but could see it's braking text to array..&nbsp;&nbsp;debug results&nbsp;It is not possible to direct compare Old pdf with new pdf because page structure is defferent.But contents are same (except dates ) for eg Old pdf has 6 pages but New pdf has 5 pages.&nbsp;&nbsp;

hkosova · Accepted Answer

Hi NisHera,
&nbsp;
This looks very similar to the array issue discussed in this thread.&nbsp;Try replacing
var textArray = text.split('\r');
with
var textArray = text.split('\r').OleValue.toArray();
and see if it helps.

Forum Discussion

splitting PDF text to array

Recent Discussions

Process "crashed" and test fails when closing the application in test

Name mapping gone

TestComplete: Connect to Azure Cosmos DB

Related Content

Splitting strings

Delphi Split function

Swagger Response In Array