Ajouter un onglet 'Advanced' #17

Closed
opened 2021-08-29 21:51:34 +02:00 by Sulley · 1 comment
Owner

Il y a un grand nombre d'option que rebook (et donc ReBook2) ne prenait pas en compte des options. J'envisage donc d'ajouter un onglet 'Advanced' permettant l'accès à des paramètres de k2pdfopt plus pointue :

-cg <inches>      Minimum column gap width in inches for detecting multiple
                  columns.  Default = 0.1 inches.  Setting this too large
                  will give very poor results for multicolumn files.  See also
                  -cgmax.
                  
-cgmax <inches>   Max allowed gap between columns in inches.  If the gap
                  between two regions exceeds this value, they will not be
                  considered as separate columns.  Default = 1.5.  Use -1 for
                  no limit (disable).  See also -cg.
                  
-cgr <range>      Set column-gap range, 0 - 1.  This is the horizontal range
                  over which k2pdfopt will search for a column gap, as a
                  fraction of the page width.  E.g. -cgr 0.5 will search
                  from 0.25 to 0.75 of the page width for a column gap.
                  Set this to a small value, e.g. 0.05, to only search for
                  column breaks in the middle of the page.  Default = 0.33.

-ch <inches>      Minimum column height in inches for detecting multiple
                  columns.  Default = 1.5 inches.
                  
-comax <range>    Stands for Column Offset Maximum.  The <range> given is as a
                  fraction of the width of a single column, and it specifies
                  how much the column divider can move around and still have
                  the columns considered contiguous.  Set to -1 to revert back
                  to how columns were treated in k2pdfopt v1.34 and before.
                  Default = 0.3.
                  
-crgh <inches>    Set the min height of the blank area that separates regions
                  with different numbers of columns.  Default = 1/72 inch.
                  
-gtc <inches>     Threshold value for detecting column gaps (expert mode).
                  Sets how many of the pixels in the column shaft can be
                  non-white (total height of a line crossing the shaft in
                  inches).  See also -gtr.  Default = .005.
                  
-gtr <inches>     Threshold for detecting gaps between rows (expert mode).
                  This option is commonly adjusted when lines of text in the
                  source file are too close together or slightly overlap and
                  don't have a clean gap between them.  It effectively sets
                  how clean that gap has to be in order for k2pdfopt to
                  consider that there are separate rows of text.  It does
                  this by setting the maximum total black pixels, in inches,
                  on average, that can be in each row of pixels before the gap
                  is no longer considered a gap.  A higher value makes it
                  easier to detect gaps between rows of text.  Too high of a
                  value may inadvertently split figures and other graphics.
                  For documents with nice clean spaces between each row of
                  text, this setting should not need to be adjusted.
                  Default = 0.006.  See also -rsf.
                  
-gtw <inches>     Threshold for detecting word gaps (expert mode).
                  See -gtr.  Default = .0015.

-nl[<pages>] [<leftbound>,<rightbound>]
-nr[<pages>] [<leftbound>,<rightbound>]
                  The source document has notes in the left (-nl) or right
                  (-nr) margins.  Specific pages can be specified for the
                  notes using <pages> (same format as -cbox or -p).  If
                  <leftbound>,<rightbound> are specified, they specify the
                  fraction of the page width where to look for the break
                  between the notes and the main page.  E.g.
                  -nl 0.15,0.25 will look for the boundary between the notes
                  and the text between 15%% and 25%% of the way across the
                  source page.  Use -nl- to turn off all processing of notes
                  in the margins (default).  Default values for <leftbound>
                  and <rightbound> are 0.05 to 0.35 for -nl and 0.65 to 0.95
                  for -nr.
                  Notes in the margins are treated differently than other
                  "columns" of text.   They will be interspersed with the
                  text in the adjacent column of main text.
                  Note that -nr... or -nl... will also set -cg to 0.05.
                  
-to[-]            Text only output.  Remove figures from output.  Figures are
                  determined empirically as any contiguous region taller than
                  0.75 inches (or you can specify this using the -jf option).
                  Use -to- to turn off (default).

etc...

Il y a un grand nombre d'option que rebook (et donc ReBook2) ne prenait pas en compte des options. J'envisage donc d'ajouter un onglet 'Advanced' permettant l'accès à des paramètres de k2pdfopt plus pointue : ``` -cg <inches> Minimum column gap width in inches for detecting multiple columns. Default = 0.1 inches. Setting this too large will give very poor results for multicolumn files. See also -cgmax. -cgmax <inches> Max allowed gap between columns in inches. If the gap between two regions exceeds this value, they will not be considered as separate columns. Default = 1.5. Use -1 for no limit (disable). See also -cg. -cgr <range> Set column-gap range, 0 - 1. This is the horizontal range over which k2pdfopt will search for a column gap, as a fraction of the page width. E.g. -cgr 0.5 will search from 0.25 to 0.75 of the page width for a column gap. Set this to a small value, e.g. 0.05, to only search for column breaks in the middle of the page. Default = 0.33. -ch <inches> Minimum column height in inches for detecting multiple columns. Default = 1.5 inches. -comax <range> Stands for Column Offset Maximum. The <range> given is as a fraction of the width of a single column, and it specifies how much the column divider can move around and still have the columns considered contiguous. Set to -1 to revert back to how columns were treated in k2pdfopt v1.34 and before. Default = 0.3. -crgh <inches> Set the min height of the blank area that separates regions with different numbers of columns. Default = 1/72 inch. -gtc <inches> Threshold value for detecting column gaps (expert mode). Sets how many of the pixels in the column shaft can be non-white (total height of a line crossing the shaft in inches). See also -gtr. Default = .005. -gtr <inches> Threshold for detecting gaps between rows (expert mode). This option is commonly adjusted when lines of text in the source file are too close together or slightly overlap and don't have a clean gap between them. It effectively sets how clean that gap has to be in order for k2pdfopt to consider that there are separate rows of text. It does this by setting the maximum total black pixels, in inches, on average, that can be in each row of pixels before the gap is no longer considered a gap. A higher value makes it easier to detect gaps between rows of text. Too high of a value may inadvertently split figures and other graphics. For documents with nice clean spaces between each row of text, this setting should not need to be adjusted. Default = 0.006. See also -rsf. -gtw <inches> Threshold for detecting word gaps (expert mode). See -gtr. Default = .0015. -nl[<pages>] [<leftbound>,<rightbound>] -nr[<pages>] [<leftbound>,<rightbound>] The source document has notes in the left (-nl) or right (-nr) margins. Specific pages can be specified for the notes using <pages> (same format as -cbox or -p). If <leftbound>,<rightbound> are specified, they specify the fraction of the page width where to look for the break between the notes and the main page. E.g. -nl 0.15,0.25 will look for the boundary between the notes and the text between 15%% and 25%% of the way across the source page. Use -nl- to turn off all processing of notes in the margins (default). Default values for <leftbound> and <rightbound> are 0.05 to 0.35 for -nl and 0.65 to 0.95 for -nr. Notes in the margins are treated differently than other "columns" of text. They will be interspersed with the text in the adjacent column of main text. Note that -nr... or -nl... will also set -cg to 0.05. -to[-] Text only output. Remove figures from output. Figures are determined empirically as any contiguous region taller than 0.75 inches (or you can specify this using the -jf option). Use -to- to turn off (default). ``` etc...
Sulley added the
enhancement
label 2021-08-30 08:08:16 +02:00
Author
Owner

Ces premières options ont été ajoutées.

Ces premières options ont été ajoutées.
Sign in to join this conversation.
No Milestone
No project
No Assignees
1 Participants
Notifications
Due Date
The due date is invalid or out of range. Please use the format 'yyyy-mm-dd'.

No due date set.

Dependencies

No dependencies set.

Reference: Sulley/ReBook#17
No description provided.