Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
1 change: 1 addition & 0 deletions .gitattributes
Original file line number Diff line number Diff line change
Expand Up @@ -10,4 +10,5 @@
*.bcfks binary
*.crt binary
*.p12 binary
*.ttf binary
*.txt text=auto
3 changes: 3 additions & 0 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
Expand Up @@ -12,6 +12,9 @@ The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.0.0/),
- Bump Apache Lucene to 9.12.2 ([#18574](https://github.com/opensearch-project/OpenSearch/pull/18574))
- Bump `commons-beanutils:commons-beanutils` from 1.9.4 to 1.11.0 ([#18401](https://github.com/opensearch-project/OpenSearch/issues/18401))
- Bump `org.apache.poi` version from 5.2.5 to 5.4.1 in /plugins/ingest-attachment ([#17887](https://github.com/opensearch-project/OpenSearch/pull/17887))
- Bump `org.apache.tika` from 2.9.2 to 3.2.2 ([#19242](https://github.com/opensearch-project/OpenSearch/pull/19242))
- Bump `org.apache.commons:commons-compress` from 1.26.1 to 1.28.0 ([#19125](https://github.com/opensearch-project/OpenSearch/pull/19242))
- Bump `org.apache.commons:commonscodec` from 1.16.1 to 1.18.0 ([#19125](https://github.com/opensearch-project/OpenSearch/pull/19242))

### Deprecated

Expand Down
1 change: 0 additions & 1 deletion client/rest/licenses/commons-codec-1.16.1.jar.sha1

This file was deleted.

1 change: 1 addition & 0 deletions client/rest/licenses/commons-codec-1.18.0.jar.sha1
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
ee45d1cf6ec2cc2b809ff04b4dc7aec858e0df8f
1 change: 0 additions & 1 deletion client/sniffer/licenses/commons-codec-1.16.1.jar.sha1

This file was deleted.

1 change: 1 addition & 0 deletions client/sniffer/licenses/commons-codec-1.18.0.jar.sha1
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
ee45d1cf6ec2cc2b809ff04b4dc7aec858e0df8f
7 changes: 6 additions & 1 deletion distribution/tools/plugin-cli/build.gradle
Original file line number Diff line number Diff line change
Expand Up @@ -80,5 +80,10 @@ thirdPartyAudit.ignoreMissingClasses(
'org.tukaani.xz.XZOutputStream',
'org.apache.commons.codec.digest.PureJavaCrc32C',
'org.apache.commons.codec.digest.XXHash32',
'org.apache.commons.lang3.reflect.FieldUtils'
'org.apache.commons.lang3.reflect.FieldUtils',
'org.apache.commons.lang3.ArrayFill',
'org.apache.commons.lang3.ArrayUtils',
'org.apache.commons.lang3.StringUtils',
'org.apache.commons.lang3.SystemProperties',
'org.apache.commons.lang3.function.Suppliers'
)

This file was deleted.

Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
e482f2c7a88dac3c497e96aa420b6a769f59c8d7
6 changes: 3 additions & 3 deletions gradle/libs.versions.toml
Original file line number Diff line number Diff line change
Expand Up @@ -46,9 +46,9 @@ httpclient = "4.5.14"
httpcore = "4.4.16"
httpasyncclient = "4.1.5"
commonslogging = "1.2"
commonscodec = "1.16.1"
commonslang = "3.14.0"
commonscompress = "1.26.1"
commonscodec = "1.18.0"
commonslang = "3.18.0"
commonscompress = "1.28.0"
commonsio = "2.16.0"
# plugin dependencies
aws = "2.20.86"
Expand Down
1 change: 0 additions & 1 deletion libs/arrow-spi/licenses/commons-codec-1.16.1.jar.sha1

This file was deleted.

1 change: 1 addition & 0 deletions libs/arrow-spi/licenses/commons-codec-1.18.0.jar.sha1
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
ee45d1cf6ec2cc2b809ff04b4dc7aec858e0df8f

This file was deleted.

Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
ee45d1cf6ec2cc2b809ff04b4dc7aec858e0df8f

This file was deleted.

Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
ee45d1cf6ec2cc2b809ff04b4dc7aec858e0df8f

This file was deleted.

Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
ee45d1cf6ec2cc2b809ff04b4dc7aec858e0df8f

This file was deleted.

Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
ee45d1cf6ec2cc2b809ff04b4dc7aec858e0df8f

This file was deleted.

Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
ee45d1cf6ec2cc2b809ff04b4dc7aec858e0df8f
8 changes: 5 additions & 3 deletions plugins/ingest-attachment/build.gradle
Original file line number Diff line number Diff line change
Expand Up @@ -38,8 +38,8 @@ opensearchplugin {
}

versions << [
'tika' : '2.9.2',
'pdfbox': '2.0.31',
'tika' : '3.2.2',
'pdfbox': '3.0.5',
'poi' : '5.4.1',
'mime4j': '0.8.11'
]
Expand Down Expand Up @@ -75,10 +75,11 @@ dependencies {

// external parser libraries
// HTML
api 'org.ccil.cowan.tagsoup:tagsoup:1.2.1'
api 'org.jsoup:jsoup:1.20.1'
// Adobe PDF
api "org.apache.pdfbox:pdfbox:${versions.pdfbox}"
api "org.apache.pdfbox:fontbox:${versions.pdfbox}"
api "org.apache.pdfbox:pdfbox-io:${versions.pdfbox}"
api "org.apache.pdfbox:jempbox:1.8.17"
api "commons-logging:commons-logging:${versions.commonslogging}"
api "org.bouncycastle:bcmail-jdk18on:${versions.bouncycastle}"
Expand Down Expand Up @@ -124,6 +125,7 @@ forbiddenPatterns {
exclude '**/*.pdf'
exclude '**/*.epub'
exclude '**/*.vsdx'
exclude '**/*.ttf'
}

thirdPartyAudit {
Expand Down
93 changes: 93 additions & 0 deletions plugins/ingest-attachment/licenses/Roboto-OFL.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,93 @@
Copyright 2011 The Roboto Project Authors (https://github.com/googlefonts/roboto-classic)

This Font Software is licensed under the SIL Open Font License, Version 1.1.
This license is copied below, and is also available with a FAQ at:
https://openfontlicense.org


-----------------------------------------------------------
SIL OPEN FONT LICENSE Version 1.1 - 26 February 2007
-----------------------------------------------------------

PREAMBLE
The goals of the Open Font License (OFL) are to stimulate worldwide
development of collaborative font projects, to support the font creation
efforts of academic and linguistic communities, and to provide a free and
open framework in which fonts may be shared and improved in partnership
with others.

The OFL allows the licensed fonts to be used, studied, modified and
redistributed freely as long as they are not sold by themselves. The
fonts, including any derivative works, can be bundled, embedded,
redistributed and/or sold with any software provided that any reserved
names are not used by derivative works. The fonts and derivatives,
however, cannot be released under any other type of license. The
requirement for fonts to remain under this license does not apply
to any document created using the fonts or their derivatives.

DEFINITIONS
"Font Software" refers to the set of files released by the Copyright
Holder(s) under this license and clearly marked as such. This may
include source files, build scripts and documentation.

"Reserved Font Name" refers to any names specified as such after the
copyright statement(s).

"Original Version" refers to the collection of Font Software components as
distributed by the Copyright Holder(s).

"Modified Version" refers to any derivative made by adding to, deleting,
or substituting -- in part or in whole -- any of the components of the
Original Version, by changing formats or by porting the Font Software to a
new environment.

"Author" refers to any designer, engineer, programmer, technical
writer or other person who contributed to the Font Software.

PERMISSION & CONDITIONS
Permission is hereby granted, free of charge, to any person obtaining
a copy of the Font Software, to use, study, copy, merge, embed, modify,
redistribute, and sell modified and unmodified copies of the Font
Software, subject to the following conditions:

1) Neither the Font Software nor any of its individual components,
in Original or Modified Versions, may be sold by itself.

2) Original or Modified Versions of the Font Software may be bundled,
redistributed and/or sold with any software, provided that each copy
contains the above copyright notice and this license. These can be
included either as stand-alone text files, human-readable headers or
in the appropriate machine-readable metadata fields within text or
binary files as long as those fields can be easily viewed by the user.

3) No Modified Version of the Font Software may use the Reserved Font
Name(s) unless explicit written permission is granted by the corresponding
Copyright Holder. This restriction only applies to the primary font name as
presented to the users.

4) The name(s) of the Copyright Holder(s) or the Author(s) of the Font
Software shall not be used to promote, endorse or advertise any
Modified Version, except to acknowledge the contribution(s) of the
Copyright Holder(s) and the Author(s) or with their explicit written
permission.

5) The Font Software, modified or unmodified, in part or in whole,
must be distributed entirely under this license, and must not be
distributed under any other license. The requirement for fonts to
remain under this license does not apply to any document created
using the Font Software.

TERMINATION
This license becomes null and void if any of the above conditions are
not met.

DISCLAIMER
THE FONT SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND,
EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO ANY WARRANTIES OF
MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT
OF COPYRIGHT, PATENT, TRADEMARK, OR OTHER RIGHT. IN NO EVENT SHALL THE
COPYRIGHT HOLDER BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY,
INCLUDING ANY GENERAL, SPECIAL, INDIRECT, INCIDENTAL, OR CONSEQUENTIAL
DAMAGES, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING
FROM, OUT OF THE USE OR INABILITY TO USE THE FONT SOFTWARE OR FROM
OTHER DEALINGS IN THE FONT SOFTWARE.

This file was deleted.

Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
ee45d1cf6ec2cc2b809ff04b4dc7aec858e0df8f

This file was deleted.

Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
e482f2c7a88dac3c497e96aa420b6a769f59c8d7

This file was deleted.

Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
fb14946f0e39748a6571de0635acbe44e7885491

This file was deleted.

Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
b4a068e1dba2b9832a108cdf6e9a3249680e3ce8
1 change: 1 addition & 0 deletions plugins/ingest-attachment/licenses/jsoup-1.20.1.jar.sha1
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
769377896610be1736f8d6d51fc52a6042d1ce82
21 changes: 21 additions & 0 deletions plugins/ingest-attachment/licenses/jsoup-LICENSE.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,21 @@
The MIT License

Copyright (c) 2009-2025 Jonathan Hedley <https://jsoup.org/>

Permission is hereby granted, free of charge, to any person obtaining a copy
of this software and associated documentation files (the "Software"), to deal
in the Software without restriction, including without limitation the rights
to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
copies of the Software, and to permit persons to whom the Software is
furnished to do so, subject to the following conditions:

The above copyright notice and this permission notice shall be included in all
copies or substantial portions of the Software.

THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
SOFTWARE.

This file was deleted.

1 change: 1 addition & 0 deletions plugins/ingest-attachment/licenses/pdfbox-3.0.5.jar.sha1
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
c34109061c3a0d85d871d9edc469ac0682f81856
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
402151a8d1aa427ea879cc7160e9227e9f5088ba
Loading
Loading