eland

mirror of https://github.com/elastic/eland.git synced 2025-07-11 00:02:14 +08:00

Author	SHA1	Message	Date
David Kyle	ee4d701aa4	Upgrade transformers to 4.47 (#752 ) The upgrade fixes a crash tracing the baai/bge-m3 model	2025-02-12 17:30:45 +00:00
Quentin Pradet	c4ac64e3a0	Allow scikit-learn 1.5 to address CVE-2024-5206 (#729 )	2025-02-12 14:34:13 +04:00
Bart Broere	75c57b0775	Support Pandas 2 (#742 ) * Fix test setup to match pandas 2.0 demands * Use the now deprecated _append method (Better solution might exist) * Deal with numeric_only being removed in metrics test * Skip mad metric for other pandas versions * Account for differences between pandas versions in describe methods * Run black * Check Pandas version first * Mirror behaviour of installed Pandas version when running value_counts * Allow passing arguments to the individual asserters * Fix for method _construct_axes_from_arguments no longer existing * Skip mad metric if it does not exist * Account for pandas 2.0 timestamp default behaviour * Deal with empty vs other inferred data types * Account for default datetime precision change * Run Black * Solution for differences in inferred_type only * Fix csv and json issues * Skip two doctests * Passing a set as indexer is no longer allowed * Don't validate output where it differs between Pandas versions in the environment * Update test matrix and packaging metadata * Update version of Python in the docs * Update Python version in demo notebook * Match noxfile * Symmetry * Fix trailing comma in JSON * Revert some changes in setup.py to fix building the documentation * Revert "Revert some changes in setup.py to fix building the documentation" This reverts commit ea9879753129d8d8390b3cbbce57155a8b4fb346. * Use PANDAS_VERSION from eland.common * Still skip the doctest, but make the output pandas 2 instead of 1 * Still skip doctest, but switch to pandas 2 output * Prepare for pandas 3 * Reference the right column * Ignore output in tests but switch to pandas 2 output * Add line comment about NBVAL_IGNORE_OUTPUT * Restore missing line and add stderr cell * Use non-private method instead * Fix indentation and parameter issues * If index is not specified, and pandas 1 is present, set it to True From pandas 2 and upwards, index is set to None by default * Run black * Newer version of black might have different opinions? * Add line comment * Remove unused import * Add reason for ignore statement * Add reason for skip --------- Co-authored-by: Quentin Pradet <quentin.pradet@elastic.co>	2025-02-04 17:43:43 +04:00
Bart Broere	9b5badb941	Drop Python 3.8 support and introduce Python 3.12 CI/CD (#743 )	2025-01-22 21:55:57 +04:00
David Kyle	5253501704	Upgrade PyTorch to version 2.3.1 (#718 ) Upgrades the PyTorch, transformers and sentence transformer requirements. Elasticsearch has upgraded to PyTorch to 2.3.1 in 8.16 and 8.15.2. For compatibility reasons Eland will refuse to upload to an Elasticsearch cluster that has is using an earlier version of PyTorch.	2024-09-30 10:22:02 +01:00
Quentin Pradet	116416b3e8	Stop duplicating requirements (#691 )	2024-05-14 15:59:39 +04:00
David Kyle	c16e36c051	Add Python 3.11 to support matrix (#681 )	2024-03-27 10:34:35 +00:00
David Kyle	ae0bba34c6	Upgrade torch to 2.1.2 (#671 ) Compatible with Elasticsearch 8.13 where the same upgrade has been made	2024-03-26 10:06:50 +00:00
Quentin Pradet	1190364abb	Release 8.12.0	2024-01-19 12:42:45 +04:00
Quentin Pradet	af26897313	Bumpy numpy and shap (#636 )	2023-11-21 13:17:53 +01:00
Quentin Pradet	b8a7b60c03	Stop mentioning Python 3.7 and Pandas 1.13 are supported (#612 )	2023-10-04 10:56:51 +02:00
Quentin Pradet	352e31ed14	Add Buildkite pipeline to push Docker image (#613 ) * Add Buildkite pipeline to push Docker image * Fix lint * Fix Read the Docs build * Replace distutils with packaging	2023-10-03 14:39:54 +02:00
Quentin Pradet	9d7c042bdb	Bump transformers to fix private model support (#611 )	2023-09-26 14:54:23 +02:00
Youhei Sakurai	4cf92fd9b7	Make eland_import_hub_model easier to find on Windows. (#559 )	2023-07-20 09:24:35 +01:00
Valeriy Khakhutskyy	f38de0ed05	Fix failing unit tests (#558 ) I updated the tree serialization format for the new scikit learn versions. I also updated the minimum requirement of scikit learn to 1.3 to ensure compatibility. Fixes #555	2023-07-10 15:15:58 +02:00
David Kyle	7820a31256	Limit NumPy to a range of versions and note why (#540 )	2023-05-22 10:47:06 +01:00
David Kyle	36bbbe0bdb	Upgrade torch to 1.13.1 and check the cluster version before uploading a NLP model. (#522 ) PyTorch models traced in version 1.13 of PyTorch cannot be evaluated in version 1.9 or earlier. With this upgrade Eland becomes incompatible with pre 8.7 Elasticsearch and will refuse to upload a model to the cluster. In this scenario either upgrade Elasticsearch or use an earlier version of Eland.	2023-05-19 16:29:38 +01:00
David Kyle	b507bb6d6c	Restrict NumPy and Pandas versions (#539 ) Shap is incompatible with NumPy 1.24 due to a deprecated usage becoming an error. There is no fix in Shap yet so an earlier version of NumPy must be used. Pandas 2.0 was recently released we will continue to use the latest 1.5 release to avoid any incompatibilities.	2023-05-19 16:04:33 +01:00
Valeriy Khakhutskyy	2ea96322b3	Update to latest ES versions and fix unit tests (#512 ) Update the test matrix to the latest Elasticsearch versions and fix the broken unit tests on the CI.	2023-01-31 20:55:29 +01:00
David Kyle	0eb36faa5b	Restrict PyTorch version not to be more advanced than that used in Elasticsearch (#479 ) Elasticsearch uses v1.11 of PyTorch. Models created with the latest PyTorch release (v1.12) are not compatible with v1.11. This pins the PyTorch version to 1.11 to prevent the incompatibility. The version of the Elasticsearch Python client is now required to be >= Eland. All users of Eland for importing NLP models should upgrade.	2022-07-07 14:56:42 +01:00
David Kyle	8448b3ba4e	Bump minimum PyTorch version to 1.11	2022-06-21 07:43:43 -05:00
David Olaru	b5ea1cf228	Align dependencies between requirement files and setup.py (#460 )	2022-04-27 07:14:49 -05:00
Seth Michael Larson	abd05df50b	Release 8.0.0	2022-02-10 14:29:54 -06:00
Seth Michael Larson	109387184a	Support the v8.0 Elasticsearch client	2021-12-09 15:01:26 -06:00
Josh Devins	1ffbe002c4	Upgrade PyTorch dependencies to latest In preparation for an 8.0 release, this updates PyTorch NLP dependencies to more recent and latest minor versions. Amongst other things, this introduces a fix from transformers that is helpful for text embedding tasks with certain DPR models. See: https://github.com/huggingface/transformers/issues/13670 Co-authored-by: Seth Michael Larson <seth.larson@elastic.co>	2021-12-06 09:05:54 -06:00
Josh Devins	df51f8af07	Document how to install transitive binary dependencies, add repo Dockerfile Co-authored-by: Seth Michael Larson <seth.larson@elastic.co>	2021-10-28 12:05:39 -05:00
Benjamin Trent	d39c1cd784	[ML] Make eland_import_hub_model an installable script	2021-10-19 11:29:58 -05:00
Josh Devins	014943d3b8	Add initial implementation of PyTorch ML models	2021-10-06 08:44:40 -05:00
Seth Michael Larson	b0c8434c06	Release 7.14.0b1	2021-08-09 09:11:57 -05:00
Seth Michael Larson	54b497ed9a	Update supported versions of Python, pandas, and Elasticsearch	2021-08-04 13:21:17 -05:00
Seth Michael Larson	c74fccbd74	Drop support for Python 3.6, pandas<1.2	2021-07-27 14:43:03 -05:00
P. Sai Vinay	193bcb73ef	Add support for Pandas v1.3 and LightGBM v3.x	2021-07-27 11:01:35 -05:00
Seth Michael Larson	31760fe02c	Release 7.10.0b1	2020-10-29 13:43:34 -05:00
Seth Michael Larson	bd7956ea72	Support typed 'elasticsearch-py' and add 'py.typed'	2020-10-20 16:26:58 -05:00
Seth Michael Larson	05a24cbe0b	Add isort, rename Nox session to 'format'	2020-10-15 17:11:29 -05:00
Seth Michael Larson	661b33dd0a	Update and rearrange documentation	2020-08-17 15:55:06 -05:00
Benjamin Trent	6ee282e19f	[ML] Add support for LGBMRegressor models	2020-08-11 07:42:59 -05:00
Seth Michael Larson	ceacf759c3	Add long Apache-2.0 license header to all files	2020-07-08 15:10:43 -05:00
Seth Michael Larson	6ca41585e9	Upgrade to elasticsearch-py v7.7	2020-05-14 10:07:10 -05:00
Seth Michael Larson	3d81def5cc	Add support for xgboost v1	2020-04-29 13:06:35 -05:00
Seth Michael Larson	15a1977dcf	Add agg compatibility logic to Field class	2020-04-27 15:16:48 -05:00
Seth Michael Larson	7946eb4daa	Add an enforce license headers	2020-04-25 16:26:58 -05:00
Seth Michael Larson	e71420c883	Release 7.6.0a5	2020-04-14 11:07:32 -05:00
Seth Michael Larson	448770df78	Restrict public API, update license header	2020-04-14 07:31:23 -05:00
Daniel Mesejo-León	e27a508c59	Update supported Pandas to v1.0	2020-03-27 12:21:15 -05:00
Seth Michael Larson	0c1d7222fe	Drop support for Python 3.5, add Black	2020-03-27 07:56:28 -05:00
Seth Michael Larson	e9a5180dac	Add python_requires to setup.py	2020-03-23 08:35:07 -05:00
stevedodson	409cb043c8	Refactoring of plotting + fixes for multiple charts (#117 ) * Refactoring of plotting + fixes for multiple charts Updates to plotting inline with pandas 0.25.3 Enables plotting of multiple histograms on the same figure. * Fix to setup.py to allow submodules + reformat of code and better Series.hist docs	2020-01-29 07:07:56 +00:00
stevedodson	1914644f93	Improve docs (#113 ) * Adding more examples * Adding more examples to README.md + pypi first page. * Updated README.md	2020-01-13 15:32:41 +00:00
stevedodson	d7207bab3b	7.5.1a2 (#110 ) * Updating README.md * New version * Fixing description for pypi	2020-01-10 15:40:15 +00:00

1 2

60 Commits