On Thu, Jun 09, 2016 at 07:56:58PM -0400, Fred Smith wrote:
On Thu, Jun 09, 2016 at 05:23:02PM -0400, Robert Moskowitz wrote:
>
>
> On 06/09/2016 03:25 PM, Fred Smith wrote:
> >On Thu, Jun 09, 2016 at 02:11:26PM -0400, Robert Moskowitz wrote:
> >>
> >>On 06/09/2016 02:01 PM, Samuel Sieb wrote:
> >>>On 06/09/2016 10:55 AM, Robert Moskowitz wrote:
> >>>>On 06/09/2016 01:38 PM, Samuel Sieb wrote:
> >>>>>On 06/09/2016 09:34 AM, Robert Moskowitz wrote:
> >>>>>>What other tool can read in pdfs and provide selecting an
> >>>>>>image (e.g. a
> >>>>>>figure in an IEEE standard) that I can then copy over to
Libre Office?
> >>>>>>
> >>>>>Why don't you just use LibreOffice to open the pdf?
> >>>>When I try opening IEEE 802.1AE-2006 pdf, it hangs. And it is only
a
> >>>>142pg document.
> >>>>
> >>>Ok, I've never tried opening one that big. And since that file is
> >>>not publicly available, I can't test it.
> >>http://standards.ieee.org/getieee802/download/802.1AE-2006.pdf
> >>
> >>All 802 standards are available free 6 months after publication. We
> >>802 attendees pay the IEEE for this in our conference attendance
> >>fee.
> >>
> >>see:
> >>
> >>http://standards.ieee.org/about/get/
> >you can use pdfseparate to extract the page you're interested int,
> >then pdfimages to get the images on that page.
> >
> >with the file you point to, for example, the image from page 86 can
> >be extracted like this:
> >
> >pdfseparate -f 86 -l 86 8*pdf fred
> >pdfimages -f 1 -l 1 -png fred foo-%d
>
> Yes, that gets fig 12-1, but.
>
> I was able to extract pg 39 for fig 7-7 to a file fred.pdf, but the
> pdfimages did not create a foo-1 file.
There's something weird about that document, there a number of figures
that do not show up in the pdfimages output. here's what it lists for
the entire document:
page num type width height color comp bpc enc interp object ID x-ppi y-ppi size
ratio
--------------------------------------------------------------------------------------------
1 0 image 459 164 index 1 8 jpx no 2737 0 257 257 9714B
13%
1 1 image 459 164 index 1 8 jpx no 2738 0 257 257 12.9K
18%
1 2 stencil 394 186 - 1 1 ccitt no 2739 0 301 301 436B
4.8%
1 3 stencil 394 184 - 1 1 ccitt no 2740 0 301 300 398B
4.4%
1 4 stencil 387 182 - 1 1 ccitt no 2741 0 301 300 413B
4.7%
1 5 stencil 387 55 - 1 1 ccitt no 2742 0 301 300 61B
2.3%
1 6 stencil 387 116 - 1 1 ccitt no 2743 0 301 300 139B
2.5%
1 7 stencil 387 256 - 1 1 ccitt no 2744 0 301 301 515B
4.2%
1 8 stencil 387 205 - 1 1 ccitt no 2745 0 301 300 220B
2.2%
1 9 stencil 96 53 - 1 1 ccitt no 2734 0 301 301 65B
10%
1 10 stencil 96 63 - 1 1 ccitt no 2735 0 301 300 118B
16%
52 11 image 670 104 index 1 8 jpx no 155 0 179 150 7267B
10%
86 12 image 675 407 index 1 8 jpx no 259 0 120 120 41.3K
15%
so either pdfimages is busted, or some of the figures in that document
are stored/created in some unusual way. I don't know enough about PDF
internals to have a clue.
Sorry I can't be of more help.
Fred
Oh, some more information:
if I open that document with evince, and scroll to figure 12-1, I can
right-click the figure and one of the options is "save image".
if I scroll to fig 7-7 (or pretty much any of the figures in there) and
right-click the image, I DO NOT get that option.
more evidence that they were created/embedded differently.
Fred
--
---- Fred Smith -- fredex(a)fcshome.stoneham.ma.us -----------------------------
"Not everyone who says to me, 'Lord, Lord,' will enter the kingdom of
heaven, but only he who does the will of my Father who is in heaven."
------------------------------ Matthew 7:21 (niv) -----------------------------