Skip to content

Commit 515ea1a

Browse files
committed
Graceful handling of Inline images
1. Inline images are understood not interpreted. But gracefully ignored in content extraction.
1 parent fecd631 commit 515ea1a

File tree

2 files changed

+77
-0
lines changed

2 files changed

+77
-0
lines changed

test/files/Pratham Sanskaran.pdf

1.09 MB
Binary file not shown.

test/files/Pratham Sanskaran.txt

Lines changed: 77 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,77 @@
1+
• W1tl tfWi{UI Cfil .~
2+
ml'qftff (fcrei'i<it) 3'1rWT cit ~ ~ 1961 ~ ;-sp. liffi il ~ ?.it 1 ~ ci1
3+
'111«! m-q;p: ~ if CfJli m ~ 3;ri:t ~ ~ CfiT1i Tifii "t!f.1cfi rcn~ ~la;lqC'fl w.,TR :AR
4+
SICfil\1(1(1 ~ <F.! sm ;?.y 'l~-ITB'qq fu~l <ism fclf'lf?f ~ Cfit U"-3l~alT i:f ~ ffi ~ m I
5+
3WWi ~ ~ JJ.q(i r\11SIT fcli ~ .qr fqf~ !ti&:J~C'fl Wm: cit~.~~ fqf~
6+
~ mf~ fi?it trr3 ~ 'l1mi ~ fclf'lf?f M cit ~~3it it~~~~ cffi
7+
~ it m cR ~I :mi: 3WWT ~ 31 ~ ~ ~~ ~ mf~ ~ LITO ~
8+
~ W. ~-:n 3fu: fcW-Tcn ~ ~ rif.i ~ ~ ~ fllR B ~ <12n w ~ cfir
9+
~: urr-1 it m ~ m; ~ ii 31~ ~ :di.~ ~ 312l it ire ::j ~ ~ ~
10+
"<i2n ~ <.l'-:0i cit ~o/.n3TI ~ >lfuRf~ ~ ~ 3-iR ~~ i1 Ttt1 ~ ":fr l'll'1~
11+
~ fcif'tl" ~ ~~ it ~ 31~~ m ~ :m mr~ T.flftt~ ~ ~ ?.fr ~ ~ ~
12+
~ ~ <F.! ~ fcv:IT ::i1Rl l>..ff I
13+
~ 'i:fT ~ ~ TJ<.IT fcf ~ ~· ~ ~ tfT3 3W: fclf~ ~!&;1r.!M! W-n-\ ciT~
14+
-uft ?.fr ~ ~ it ~ ~ ~ ~ fbl-.:lR ~ ~ ~ ~ ~ 3";:{ fH~i{i if ~ T.R
15+
R:UlfOI"'fl ~ <:f.r ~ni: : ~ qqr rrii c;rc: ·~ ~ .Jmf1 m am· ~ mcnro ~· i1
16+
fllllf014.1 irlfr ~ cr: W.fcf fur;.w: ctl1 ~ 3'R'Ri ~ ~ :;11"'<1~(11 31R f~..fl TTI"c <l~i cf.J
17+
m ~ ~ m1
18+
3WWr ~ ~ ~ Cf.B ir ~ 31f~~ if >IT~~~~ m ~141<.1<-il i:r
19+
WT~ ~ men, ~ itt 31 31f~ <if '4'f >n~ ~ =Tf 'qp::i ~ fa.f'l.T?r
20+
fct¥Jfci'n1Mqi ~ 41<}4*l11 B BMR1d ~ ~ % I ~ ~ ~ ~ ~ CfTT ir
21+
M'~ 31f~ it ~ m~ ~ 1 -~ ~ .3WWT · 168 ~ 311~~ ~ ftRT tiTG
22+
Wm: Cfl\ ¥-1 ~ ~ if 142 ~'l1fl'.n 31f~, 1963 cit 'tiro 5 ( 1 ) ~ ~~ 'J1m'f ~
23+
~ ifi >nf~ it 'l1mi ifi ~ i:t >lCfilf~l(i m ~ % 1 31(1: ~ il 31 31f~ it:
24+
mf'tfcFf f~--£1 ~ it ~ % 3fR qtt ~ .3fu ~ ~ % ~ 31 ~~ ifi ~ qr~
25+
<F.! t I
26+
~ :m~ it ~ fcifw. ~lci;lq<.'i~ <tr ~ ~ ,~ ~ ~ ~ m
27+
~ fffir_ qq:,Jf~k1 fcf;m ~ ® ~ I ·31Tlfrrr <F.! trf. fct¥Jm % fcli f3H't:1 'lif fa.f~~ ~.
28+
fcrf'iTcf. <:>qqfW"l ~~ fcif~ 'Cf&f trr0.:f if ~ % j.1 ~ if; fc;:rQ, 7:lb ~ mrTr fff.;;
29+
m-rrr i ~-m qfrj *' ~ ~ tJ fcf. w cW1f c8 ..n ~~Jll m~r.,4t:l! ~ rr:~ q1ai CfT ~
30+
t.n iPn ~ ir (54 ~~ Ii >l'f=H ~r&:lq«fl ~ ~ m CfP.i irn 1 f.t.--q fuP6 ~
31+
3-li~· C'fi1 "f0ITf1J -.:j o.:ft ~ ·i:J ~.fit ~ ..n ~ ~ if iT Tit ~~ Cf;'i W
32+
~ 'W!Ti
33+
~ f-14 ~!1"41 ~,~j 3fu: q;:_p•-lf0.£ii <iJ Bfa:JIMct fcFlrr Tf'n ~ 31Cf.T ·~ if Cffi
34+
31~ fu7..n Tfl11 ~ !~A~ ~ ~~~ 31f~f.p:p:lf it )f'fcRl' ~ 7.ff ~ % 1 34~-il 31~ ~ qw, m
35+
...-; o.,...J .::; .......
36+
>r:r& crW:r Rlff ·o/-IT ·~ ~ m ~ 31f~ -q ~ TTlfT % : ~ m~ ir
37+
Cfl€\'T-~· cf.rWf. it ~B{ ~ 'l:f1 ~ TfQ. % if 31m ~ f~~~ if f'lf?f 'l1TEn311 ~ ~wn
38+
ifi ~ lT(f i:r: 311~ '1\ ~ R1 it >r:r& m rr:r ~ TN. % m ~ B, 31 ~
39+
~~ it j.1 ~ ~ 411 W 'lit fcfv.:n 7Jfl WRIT % I ~ 31ltm tR CfiiJ' -CfltY ~ ~
40+
"'· .
41+
'4f RQ TfQ, f.' 0TI ~ ~ ;$if if ~ Tc: %I 7:lb ~ ~J."lfM< 'if T!i t fcF. ~ CfT
42+
~ ~ · ~ Cf.@. 1im ::: -ir SITQ f.f f?.;:fr ~ ~ f~. iT % ~ ~ mm GR
43+
~I
44+
il9 • 3WWr tliT w:~m ~ ~ ~ % fcfi ~ m m ~ ~ ~ ~· <.t>1 >rlF1
45+
~~ ~ ~ ~ ~ ~. ~ ~ ~ 1 awW-r ~ ~ ~ ~ m <.t>1 ~
46+
~ tliT\UT ~an ~ <.it fcfi ~ ~ 'qfqf ~ f'l1?f 'qfqf tliT ~ ~ 1 ~ 'qfqf ~ ~ ~
47+
~ ~ ~ ~ ~ ~ m ~~* ~ ~ 3Pl 'qfqf~~ ~ "GPi ~en&%'
48+
3WWT ~ 3llHI ~I ~ >l'liR ~. ~. CffiUl ~ ~ ~ ~ ~ ~ & ~
49+
~~I fqf~ fcfT.nU ~ ~ ~~ 3RR ~~*~~<fiB~~~ it
50+
~ ~ ~ m ~ ~ ~ ·~ fufl!. ~ <.t>1 3Pl 'qfqf~ ~ ~ ~ ~ ~ ~ 1 m
51+
~% fcf; 3lJtq ~ ~ ~ ~ ~ ~ ~·tliT\UT ~~ 1 it%~ &ffi ~
52+
~ ~ 1 ~ ~ ~ ~ ~ 'qfqf~ ~ fufl!. ~ ~ ~ ~ ~% ~ ~ m ~ ~ ~ ~
53+
~~ ~~~~m~m~ 'qfqf~~m~1 ~~m~~
54+
~ 351 ~ ~ ~ wij ~ ~ ~ %I ~: ~ fen~ ~I<IZJCiMl awWr ~
55+
~. ~ ~ ~'l:JCJ ~ ~ CfiT ~· ~ ~ ~ 'qfqf it ~ ~ ~ ~ ~
56+
cqmr <.t>1 a:rr~ 'qfqf~ CfiT • -m %1T ~ 1 1R ~ CfiT "ffi1cfi ~ ~ wr ~ %
57+
RfWOT~ ciT ~an ~ ~ \1ffll {fcfi ~'l:JCJ m ~ ~ ~ ~ ~ <:IT en ~ 'l1l1"n <:IT
58+
~q~u~;;ff ~ <n ~ <.it 3Pl 'qfqf~ ~ ~ it m ~ ~ :31~ ~ ~ ~ m~ ~
59+
~ 'qfqf~ ~ ~ ~ ~ ~ ~ 31fuR1 ~ 'lWn f~ ~ ~
60+
~4~UJR1 ~ ~ ~ it :31~ ~ ~ ~ ~fi1R1~ ~ ~ ~ <:rr en ~ ~
61+
~ ~ ~8.1 ~ ~ ~ :31~ ~ _mcfiR ~ ~-~ 1R ~· %<:fT ~I
62+
3WWT ~ ~ ~ ~ ~ fcp;ft 1Q, ~ ~ ~ CfiT W-lrn ~ %<:fT ~ I
63+
-m ~ ~% ~~~~~~~31ft~~~ it~~
64+
~ 'qfqf en ~ W-lfu ~ ~ ~~'i:WUT <.fit 'qJlfl it ~ ~ {fcfi f'l1?f men ~ 1 ~~ ~
65+
<.t>1 ~ ~ fufl!. ~ ~ CfiT m ~ ~ ~ 1 rcn~ en~~~~
66+
~~ ~ ~-~ ~ ~ it ~ l 31(1: ~ en w~ :31~ crr#f ~ m
67+
~ ~ ~ ~ ~ I ~ CfiRUT ~ % ~-~ 3WWT ~ ~'i:WUT <.it 'lWn ~ ~
68+
~ ~0
69+
~ ~ 3Pl ~ CfiT m ~ ~ ~~
70+
3WWT ~ ~ ~ ~ ~ % fiR ~· 'qfqf ~ ~ CfiT m ~m ~llll<1lff
71+
"(;~ ~ ~ %<:fT ~ ~ ~ ~ ~ 4llll!Cll-.::iJ ~ ~ ~ ~ ~ ~ ~~I
72+
3WWT CfiT fcfsarn ~ % a:IT~ ~ mf~ qrc it ~ ~ ~ cmur ~ ~ ciT
73+
:31~ ~,ll,w•lY ~ ~ it m m ~ ~ fuR m ~ 1 3WWT <it ~ ·'lfr 31TW ~
74+
% ~ ~ ~ ~ it fqf~ ~ ·~ ~ ~ 'qfqf CfiT m ~ -m ~ 3fu:
75+
~~. ~. ma.:rcfi m fc:Rn2ff ~~it~ fqf~ <fiPf :31~ 'TcR ~
76+
~m~~~~
77+
720 •

0 commit comments

Comments
 (0)