eland

mirror of https://github.com/elastic/eland.git synced 2025-07-11 00:02:14 +08:00

Author	SHA1	Message	Date
Quentin Pradet	41db37246f	Release 8.11.0	2023-11-08 11:51:14 +01:00
Valeriy Khakhutskyy	6cecb454e3	[ML] Better memory estimation for NLP models (#568 ) This PR adds an ability to estimate per deployment and per allocation memory usage of NLP transformer models. It uses torch.profiler and performs logs the peak memory usage during the inference. This information is then used in Elasticsearch to provision models with sufficient memory (elastic/elasticsearch#98874).	2023-11-06 12:18:20 +01:00
Bart Broere	28e6d92430	Stream writes in to_csv() Co-authored-by: P. Sai Vinay <pvinay1998@gmail.com>	2023-11-06 11:39:31 +01:00
David Kyle	5b3a83e7f2	[NLP] Support E5 small multi-lingual (#625 ) Although E5 small is a BERT based model it takes 2 parameters to forward not 4. Use the tokenizer type to decide the number of parameters	2023-10-31 17:49:43 +00:00
David Kyle	ab6e44f430	[NLP] Tests for NLP model configurations (#623 ) Add tests for generated Elasticsearch model configurations	2023-10-19 12:39:57 +01:00
Quentin Pradet	6a4fd511cc	Release 8.10.1 (#620 )	2023-10-11 12:56:24 +02:00
Quentin Pradet	c6ce4b2c46	Fix direct usage of TransformerModel (#619 )	2023-10-11 11:56:14 +02:00
Bart Broere	48e290a927	Prepare for deprecation of is_datetime_or_timedelta_dtype in Pandas 2.0 (#592 )	2023-10-10 19:37:13 +01:00
Quentin Pradet	bb0c111a68	Release Eland 8.10.0	2023-10-09 11:49:12 +02:00
Quentin Pradet	352e31ed14	Add Buildkite pipeline to push Docker image (#613 ) * Add Buildkite pipeline to push Docker image * Fix lint * Fix Read the Docs build * Replace distutils with packaging	2023-10-03 14:39:54 +02:00
Quentin Pradet	566bb9e990	Allow importing private HuggingFace models (#608 )	2023-09-25 15:10:58 +02:00
Jonathan Buttner	a8b76c390f	Setting chunk size to 1mb (#605 )	2023-09-20 11:40:11 -04:00
Bart Broere	12200039f5	Fix iteritems deprecation (#593 )	2023-09-19 12:00:32 +02:00
David Kyle	301cda8d69	Error measuring embedding size for some DPR models (#573 ) Fixes an error unpacking a tuple that contains a single element.	2023-09-19 10:44:15 +01:00
Bart Broere	5c5ef63a69	Use the workaround if we can't determine the server's version (#581 )	2023-09-15 15:29:36 +04:00
Enrico Zimuel	ac8c7c341e	Readded author info	2023-08-24 11:18:17 +02:00
Enrico Zimuel	ebdebdf16f	Prep for 8.9.0 release	2023-08-24 11:11:48 +02:00
Enrico Zimuel	932092c0e5	Fixed test for mean using ES 8.9.0	2023-08-24 10:46:14 +02:00
Josh Devins	f26fb8a430	Simplify embedding model support and loading (#569 ) We were attempting to load SentenceTransformers by looking at the model prefix, however SentenceTransformers can also be loaded from other orgs in the model hub, as well as from local disk. This prefix checking failed in those two cases. To simplify the loading logic and deciding which wrapper to use, we’ve removed support for text_embedding tasks to load a plain Transformer. We now only support DPR embedding models and SentenceTransformer embedding models. If you try to load a plain Transformer model, it will be loaded by SentenceTransformers and a mean pooling layer will automatically be added by the SentenceTransformer library. Since we no longer automatically support non-DPR and non-SentenceTransformers, we should include somewhere example code for how to load a custom model without DPR or SentenceTransformers. See: https://github.com/UKPLab/sentence-transformers/blob/v2.2.2/sentence_transformers/SentenceTransformer.py#L801 Resolves #531	2023-07-31 18:18:46 +02:00
Youhei Sakurai	4cf92fd9b7	Make eland_import_hub_model easier to find on Windows. (#559 )	2023-07-20 09:24:35 +01:00
Valeriy Khakhutskyy	77781b90ff	[ML] Update trained model inference endpoint (#556 ) Infer trained model deployment API has been deprecated, so I changed the code to use the new one.	2023-07-11 10:55:11 +02:00
Valeriy Khakhutskyy	f38de0ed05	Fix failing unit tests (#558 ) I updated the tree serialization format for the new scikit learn versions. I also updated the minimum requirement of scikit learn to 1.3 to ensure compatibility. Fixes #555	2023-07-10 15:15:58 +02:00
Youhei Sakurai	55967a7324	Minimize if main section (#554 ) For migration from scripts to console_scripts in setup.py, the current long if __name__ == "__main__": section is a blocker because the console_scripts requires to specify a function as an entrypoint. Move the logic into a main() function.	2023-07-05 10:49:16 +01:00
Dai Sugimori	bf3b092ed4	Add BertJapaneseTokenizer support with bert_ja tokenization configuration (#534 ) See elasticsearch#95546	2023-06-23 08:14:27 +01:00
Benjamin Trent	8b327f60b8	[ML] add ability to upload xlm-roberta tokenized models (#518 ) This allows XLMRoberta models to be uploaded to Elasticsearch. blocked by: elastic/elasticsearch#94089	2023-06-14 07:59:28 -04:00
David Kyle	68a22a8001	Default the optional es_version parameter (#545 )	2023-06-07 12:34:53 +01:00
David Kyle	32ab988eb6	Tolerate different model output formats when measuring embedding size (#535 ) Only add the embedding_size config option if the target Elasticsearch cluster version supports it	2023-05-25 12:25:31 -05:00
David Kyle	1e6f48f8f4	Generate valid NLP model id from file path (#541 ) The eland_import_hub_model script supports uploading a local file where the --hub-model-id argument is a file path. If the --es-model-id option is not used the model Id is generated from the hub model id and when that is a file path the path must be converted to a valid elasticsearch model id.	2023-05-22 15:37:36 +01:00
Seth Michael Larson	f7ea3bd476	Add a compatibility layer for Elasticsearch server 8.5.0 field_caps API	2023-05-02 15:40:20 -05:00
David Kyle	50d301f7cb	Set embedding_size config parameter for Text Embedding models (#532 )	2023-04-25 11:41:14 +01:00
David Kyle	940f2a9bad	[NLP] Add support for the pass_through task #526	2023-04-06 15:43:00 +01:00
David Kyle	8e0d897171	[NLP] Prevent TypeError with None check (#525 )	2023-04-03 14:56:19 +01:00
Seth Michael Larson	44e04b4905	Release v8.7.0	2023-03-30 14:00:02 -05:00
David Kyle	7f4687c791	[ML] Text expansion model config support (#520 )	2023-03-08 15:40:14 +00:00
Benjamin Trent	d5578637cb	Choose text_embedding from auto when task type is unknown but its a sentence-transfomers model (#516 ) closes https://github.com/elastic/eland/issues/514	2023-02-09 12:50:30 -05:00
Valeriy Khakhutskyy	0576114a1d	[ML] Export ML model as sklearn Pipeline (#509 ) Closes #503 Note: I also had to fix the Sphinx version to 5.3.0 since, starting from 6.0, Sphinx suffers from a TypeError bug, which causes a CI failure.	2023-02-01 16:17:06 +01:00
Valeriy Khakhutskyy	2ea96322b3	Update to latest ES versions and fix unit tests (#512 ) Update the test matrix to the latest Elasticsearch versions and fix the broken unit tests on the CI.	2023-01-31 20:55:29 +01:00
David Kyle	c55516f376	Fixes for two type hinting issues	2023-01-04 09:53:09 -06:00
David Kyle	211cc2c83f	Handle OSError for missing LightGBM dependency Co-authored-by: Seth Michael Larson <seth.larson@elastic.co>	2022-11-02 11:32:27 -05:00
Benjamin Trent	a8c8726634	[ML] add text_similarity task support (#486 ) Adds text_similarity task support. This is a cross-encoder transformer task where both sequences are given to the transformer at once. According to 🤗 (or at least how the cross-encoder models are concerned) this is a sequence classification task with just one classification "label". But really, it isn't labeled at all and is more akin to a regression model. related: elastic/elasticsearch#88439	2022-08-01 09:04:34 -04:00
Seth Michael Larson	c97e69410d	Release v8.3.0	2022-07-11 13:14:13 -05:00
David Kyle	0eb36faa5b	Restrict PyTorch version not to be more advanced than that used in Elasticsearch (#479 ) Elasticsearch uses v1.11 of PyTorch. Models created with the latest PyTorch release (v1.12) are not compatible with v1.11. This pins the PyTorch version to 1.11 to prevent the incompatibility. The version of the Elasticsearch Python client is now required to be >= Eland. All users of Eland for importing NLP models should upgrade.	2022-07-07 14:56:42 +01:00
Benjamin Trent	8892f4fd64	[ML] adds new auto task type that attempts to automatically determine NLP task type from model config (#475 ) For many model types, we don't need to require the task requested. We can infer the task type based on the model configuration and architecture. This commit makes the `task-type` parameter optional for the model up load script and adds logic for auto-detecting the task type based on the 🤗 model.	2022-06-23 08:32:23 -04:00
David Kyle	081c8efaa0	Freeze the traced PyTorch model	2022-06-21 07:43:18 -05:00
Benjamin Trent	ec041ffdfd	[ML] ensure quantization is applied (#472 )	2022-06-15 09:23:24 -04:00
Nigel Small	a4838f4d22	Ignore type checking for `agg_value`	2022-05-31 09:23:15 -05:00
Benjamin Trent	fa30246937	[ML] fixes decision tree classifier upload to account for probabilities (#465 ) This switches our sklearn.DecisionTreeClassifier serialization logic to account for multi-valued leaves in the tree. The key difference between our inference and DecisionTreeClassifier, is that we run a softMax over the leaf where sklearn simply normalizes the results. This means that our "probabilities" returned will be different than sklearn.	2022-05-17 08:11:20 -04:00
Seth Michael Larson	5bbb8e484a	Release 8.2.0	2022-05-11 06:38:21 -05:00
Benjamin Trent	650e02d16e	[ML] improve general pytorch model import and add tests (#463 ) This improves the user consumed functions and classes for PyTorch NLP model upload to Elasticsearch. Previously it was difficult to wrap your own module for uploading to Elasticsearch. This commit splits some classes out, adds new ones, and adds tests showing how to wrap some simple modules.	2022-05-05 10:50:53 -04:00
Benjamin Trent	70fadc9986	[ML] add support for question_answering NLP tasks (#457 ) Adds support for `question_answering` NLP models within the pytorch model uploader. Related: https://github.com/elastic/elasticsearch/pull/85958	2022-05-04 13:15:33 -04:00

1 2 3 4 5 ...

355 Commits