NVDA does not read a pdf document properly #2756

nvaccessAuto · 2012-10-29T10:24:07Z

Reported by aliminator on 2012-10-29 10:24
In this scenario, NVDA does not read the document properly. It seems to be that especially for every word the first two letters are doubled and repeated.
Please have a look at this document located at
[http://www.studentenwerk-giessen.de/docs/HSG/Speisepl%E4ne/THM-GI.pdf]

You don't need to understand the contents.
But please compare the output of NVDA 2012.2.1 and 2012.3b3.
The former version works properly with the document and shows the words appropiately.
Adobe Reader X 10.1.4 was used.

nvaccessAuto · 2012-10-29T10:39:16Z

Comment 1 by aliminator on 2012-10-29 10:39
I forgot to mention that the issue occurred when tables should be displayed.
It could not be reproduced using another document with a table either.

nvaccessAuto · 2012-10-29T11:37:36Z

Comment 2 by briang1 on 2012-10-29 11:37

Yes, a little way down the line starts with the word pizza, and the second word according to the beta is..
DiDio
However, on the current release we get what it should be...
Diavolo

I have never seen this on english pdfs,with or without tables, though there is stil an anomoly where spaces between words will be removed if a graphic is in front of the text, or appears to be according to nvda.

nvaccessAuto · 2012-10-30T06:55:11Z

Comment 3 by jteh on 2012-10-30 06:55
We did change the way text is retrieved to work around a bug in Adobe Reader which prevented us from retrieving any formatting. However, I don't see doubled characters with Adobe Reader XI (which is the latest version). Perhaps there's a fix in Reader XI which affects the way we're doing this now. Please test with Reader XI and report.

nvaccessAuto · 2012-10-30T08:33:56Z

Comment 4 by jteh on 2012-10-30 08:33
00d0854 may help, though I still think this is probably a Reader bug.

nvaccessAuto · 2012-10-30T09:18:45Z

Comment 5 by briang1 on 2012-10-30 09:18
Well that file does work as far as reading is concerned in Adobe 11 on XP but control cursor navigate by word seems not to, it gets only part of each word, but once again its OK in a normal type pdf of course.
Hope that helps.

nvaccessAuto · 2012-10-30T15:40:55Z

Comment 6 by briang1 on 2012-10-30 15:40
Yes the fix just committed seems to have fixed it in reader X
The strange reading of words seems to be still there, but may well be something to do with the format of the table used. Not sure, but if a comma on the nend of the word it works.
Incidentaly, if anyone has a way to report bugs to Adobe maybe the one where the alert about removing protected mode at the start could be fixed by them. At present it says that the checbox fo this to help XP users see the content is in the general section of prefs, it is in fact now moved to enhanced security in adobe reader 11, but the text was obviously not changed before release.

nvaccessAuto · 2012-10-30T16:24:41Z

Comment 7 by aliminator on 2012-10-30 16:24
Hmm I tested this issue in Adobe Reader XI; no defect such as in X. It seems to be fixed, although in the table (especially in the first row) the last letter of each word is being read separately as if it is not one word. In braille, it is displayed correctly. But I think this is not a new one. Is there any ticket already opened for this issue?

nvaccessAuto · 2012-10-30T17:06:02Z

Comment 8 by briang1 on 2012-10-30 17:06
Just to get this so we are all talking about the same thing. My comment above was me testing the latest snap with the fix in it with reader X, and the comments about the control/cursor reading applies to both X and 11. I assume this is what you mean above. I noted that it read the words with the commas, but often missed the last char off of other words. Its a strange file this one, it would be interesting to know what produced it as I had some tabular info in a pdf in English and this was not happening there.

nvaccessAuto · 2012-10-30T17:20:13Z

Comment 9 by jteh (in reply to comment 7) on 2012-10-30 17:20
Replying to aliminator:

It seems to be fixed, although in the table (especially in the first row) the last letter of each word is being read separately as if it is not one word.

I think this is a problem with the PDF. Those letters are split into separate nodes for some reason and formatting info can't be retrieved for them either. This is a fairly inaccessible PDF; no tagging for a start.

Marking as fixed as per comment:8.
Changes:
Milestone changed from None to 2012.3
State: closed

nvaccessAuto added bug feature/browse-mode bug/regression labels Nov 10, 2015

nvaccessAuto assigned jcsteh Nov 10, 2015

nvaccessAuto added this to the 2012.3 milestone Nov 10, 2015

nvaccessAuto closed this as completed Nov 10, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NVDA does not read a pdf document properly #2756

NVDA does not read a pdf document properly #2756

nvaccessAuto commented Oct 29, 2012

nvaccessAuto commented Oct 29, 2012

nvaccessAuto commented Oct 29, 2012

nvaccessAuto commented Oct 30, 2012

nvaccessAuto commented Oct 30, 2012

nvaccessAuto commented Oct 30, 2012

nvaccessAuto commented Oct 30, 2012

nvaccessAuto commented Oct 30, 2012

nvaccessAuto commented Oct 30, 2012

nvaccessAuto commented Oct 30, 2012

NVDA does not read a pdf document properly #2756

NVDA does not read a pdf document properly #2756

Comments

nvaccessAuto commented Oct 29, 2012

nvaccessAuto commented Oct 29, 2012

nvaccessAuto commented Oct 29, 2012

nvaccessAuto commented Oct 30, 2012

nvaccessAuto commented Oct 30, 2012

nvaccessAuto commented Oct 30, 2012

nvaccessAuto commented Oct 30, 2012

nvaccessAuto commented Oct 30, 2012

nvaccessAuto commented Oct 30, 2012

nvaccessAuto commented Oct 30, 2012