42 Commits

Author SHA1 Message Date
kiran-nlmatics
6d1c420233 Merge pull request #79 from nlmatics/correct_pandas
Update Pandas version to be inline with setup.py
v0.1.9
2024-07-26 14:27:20 -04:00
Kiran N
14c6279d1f Update Pandas version to be inline with setup.py 2024-07-26 14:23:57 -04:00
Michael Feil
5c687ecd6b Update setup.py 2024-07-26 10:16:16 -04:00
Ambika Sukla
b3608fe59f Merge pull request #70 from jamesvillarrubia/update-to-nlm-2.9.2_v2
Update to nlm 2.9.2 v2
2024-07-26 10:15:04 -04:00
James Villarrubia
39b1c1bd61 Merge branch 'main' into update-to-nlm-2.9.2_v2 2024-06-22 14:49:14 -04:00
Ambika Sukla
465e6a1a72 Update docker-publish.yml v0.1.8 2024-06-13 06:45:19 -04:00
James Villarrubia
5d8bf820f7 udpates readme reference 2024-06-12 14:18:56 -04:00
James Villarrubia
b60bf9507a Revised tika jar. Jar now includes missing modifications for writeInnerString and the start/end <p> functions. 2024-06-12 14:14:41 -04:00
James Villarrubia
436d0fcca9 Additional header required to get style tags for visual processing. v0.1.7 2024-06-12 09:58:20 -04:00
James Villarrubia
21bc8fe779 no message 2024-06-12 09:58:20 -04:00
Ambika Sukla
0959289d7c Merge pull request #64 from mgl/main
Download encodings during build to run Docker image offline
2024-06-12 09:57:38 -04:00
Ambika Sukla
333d73c023 Merge branch 'main' into main 2024-06-12 09:56:53 -04:00
Jenny Li
2e0f08773a expose port
Signed-off-by: Jenny Li <jenny@joseflegal.com>
2024-06-12 09:55:47 -04:00
James Villarrubia
25b0b39a9a Additional header required to get style tags for visual processing. 2024-06-12 09:31:25 -04:00
James Villarrubia
d890deab29 no message 2024-06-11 19:55:40 -04:00
Marius
2fd6a643f8 Download encodings during build
Fixes the image not running in a network restricted environment
2024-05-29 19:30:43 +02:00
kiran-nlmatics
f7279087ba Merge pull request #62 from nlmatics/tika_config
Add the default config file for tika
2024-05-25 21:33:45 -04:00
Kiran N
782bd252c2 Add the default config file for tika 2024-05-25 21:32:46 -04:00
moveyor
0c7903d48d Update __main__.py
Enable GET method on health check
2024-04-18 15:29:35 -04:00
kiran-nlmatics
373d8a37cf Merge pull request #54 from nlmatics/bbox_table
Correct the BBOX for table blocks
2024-04-17 12:46:12 -04:00
Kiran N
f8bfa8b402 Correct the BBOX for table blocks 2024-04-17 12:42:38 -04:00
Ambika Sukla
cdfc6393e6 made changes to integrate with indexer 2024-03-29 02:01:56 -04:00
Bo Bao
d66935adef [PDF Ingestor] make sure key idx within the range of sorted freq keys 2024-03-26 11:31:03 -04:00
pashpashpash
f707dcab24 health check 2024-02-12 08:10:44 -05:00
pashpashpash
1716e776a0 added health check 2024-02-12 08:10:44 -05:00
Ian Schmitz
554feb13b9 Deploy multi-platform docker image v0.1.6 2024-02-08 21:16:10 -05:00
Ambika Sukla
8f1bcb46ba Merge pull request #8 from erjanmx/fix-readme-typo
Fix readme typo
2024-01-27 15:29:55 -05:00
Erjan K
4f142019ce Fix readme typo 2024-01-27 20:54:34 +01:00
Ambika Sukla
d10178e89c bumpted version to 0.1.5 v0.1.5 2024-01-26 16:14:34 -05:00
Ambika Sukla
0fb90ee9a0 fixed html ingestor break due to new bbox code 2024-01-26 16:13:24 -05:00
Ambika Sukla
e4789a49e4 moved installation steps up in README 2024-01-24 09:30:34 -05:00
Ambika Sukla
a9e297da07 improved documentation 2024-01-24 09:26:14 -05:00
Ambika Sukla
b7691ada14 improved documentation and example notebooks 2024-01-24 09:00:49 -05:00
Ambika Sukla
c2218a1d73 run the server without debug v0.1.4 2024-01-23 20:48:24 -05:00
Ambika Sukla
f97814c85e added tesseract and removed unwanted files v0.1.3 2024-01-23 20:09:51 -05:00
Ambika Sukla
e59831bac5 added bbox, fixed imports and bumped version v0.1.2 2024-01-23 17:55:27 -05:00
Ambika Sukla
62ff19008e added more platforms to git build v0.0.3 2024-01-23 15:58:17 -05:00
Ambika Sukla
c3c26da594 create docker images on releases only v0.0.2 2024-01-23 09:50:43 -05:00
Ambika Sukla
f1872b4809 Create python-publish.yml 2024-01-23 09:30:28 -05:00
Ambika Sukla
e2ab5c3edc added docker instructions and fixed links on README 2024-01-23 09:28:33 -05:00
Ambika Sukla
22fb1fb26e Create docker-publish.yml v0.0.1-test 2024-01-23 08:30:16 -05:00
Ambika Sukla
5c144468a7 first commit to open source code under apache 2.0 license 2024-01-23 03:25:07 -05:00