István Zoltán Szabó
5a3e7d78b3
[DOCS] Completes the list of available NLP task types. ( #699 )
2024-06-10 12:30:07 +02:00
Bart Broere
1014ecdb39
Fix non _source fields missing from the result hits ( #693 )
2024-06-10 11:09:52 +04:00
David Kyle
632074c0f0
Make eland_import_hub_model script compatible with serverless ( #698 )
...
Checks for build_flavor == serverless rather than a version
2024-06-07 14:46:12 +01:00
Bart Broere
35a96ab3f0
Fix missing method str.removeprefix in Python 3.8 ( #695 )
2024-05-24 10:25:04 +04:00
Quentin Pradet
116416b3e8
Stop duplicating requirements ( #691 )
2024-05-14 15:59:39 +04:00
Ashok Kumar
5b728c29c1
Replace check for Elasticsearch to str/list in ensure_es_client ( #690 )
2024-05-04 09:01:31 +04:00
Quentin Pradet
e76b32eee2
Release 8.13.1
v8.13.1
2024-05-03 09:20:45 +04:00
Quentin Pradet
fd38e26df1
Support HTTP proxies in eland_import_hub_model ( #688 )
...
* Document TLS/SSL options for import script
* Mention --help option
* Add HTTP proxy support
* Mention HTTP_PROXY too
---------
Co-authored-by: David Kyle <david.kyle@elastic.co>
2024-05-02 21:03:44 +04:00
Quentin Pradet
f7f6e0aba9
Document TLS/SSL options for import script ( #667 )
2024-05-02 18:06:40 +04:00
Aurélien FOUCRET
9cea2385e6
Work around LTR model cache in tests ( #685 )
2024-04-08 14:00:36 +04:00
Quentin Pradet
1921792df8
Release 8.13.0
v8.13.0
2024-03-27 18:18:21 +04:00
David Kyle
c16e36c051
Add Python 3.11 to support matrix ( #681 )
2024-03-27 10:34:35 +00:00
David Kyle
ae0bba34c6
Upgrade torch to 2.1.2 ( #671 )
...
Compatible with Elasticsearch 8.13 where the same upgrade has been made
2024-03-26 10:06:50 +00:00
Iulia Feroli
aaec995b1b
Update overview.asciidoc to replace tuple reference to API Key ( #678 )
2024-03-21 15:31:19 +04:00
Iulia Feroli
de83f3f905
Improve PyTorch installation instructions ( #677 )
2024-03-21 14:21:32 +04:00
David Kyle
8e8c49ddbf
Mute the Learning to Rank tests ( #676 )
2024-03-21 10:13:31 +00:00
David Kyle
5d34dc3cc4
Add override option to specify the model's max input size( #674 )
...
If the max input size cannot be found in the configuration the user
can specify it as a parameter to the eland_import_hub_model script
2024-03-20 10:02:43 +00:00
Bart Broere
9b335315bb
Mirror pandas' to_csv lineterminator instead of line_terminator ( #595 )
...
* Mirror pandas' to_csv lineterminator instead of line_terminator
(even though it looks a little weird perhaps)
* Remove squeeze argument
* Revert "Merge branch 'remove-squeeze-argument' into patch-2"
This reverts commit 8b9ab5647e244d78ec3471b80ee7c42e019cf347.
* Don't remove the parameter yet since people might use it
* Add pending deprecation warning
---------
Co-authored-by: David Kyle <david.kyle@elastic.co>
2024-02-23 14:23:58 +04:00
Quentin Pradet
28eda95ba9
Add feedback request to README ( #665 )
2024-02-15 15:23:45 +04:00
Quentin Pradet
f4b30753ad
Fix CI badge in README ( #664 )
2024-02-15 15:14:16 +04:00
Bart Broere
33cf029efe
Implement eland.DataFrame.to_json ( #661 )
...
Co-authored-by: Quentin Pradet <quentin.pradet@elastic.co>
2024-02-15 11:32:54 +04:00
Aurélien FOUCRET
9d492b03aa
Release 8.12.1
...
Co-authored-by: Quentin Pradet <quentin.pradet@elastic.co>
v8.12.1
2024-02-01 10:50:18 +04:00
Quentin Pradet
fd2ceab846
Run Buildkite docs jobs in pull requests from forks ( #652 )
2024-01-31 20:55:19 +04:00
Quentin Pradet
02190e74e7
Switch to 2024 black style ( #657 )
2024-01-31 14:47:19 +04:00
Aurélien FOUCRET
2a6a4b1f06
Fix missing value support for XGBRanker. ( #654 )
...
* Fix missing value support for XGBRanker.
* lint
* Sort expected scores
* lint
2024-01-23 18:42:24 +01:00
Quentin Pradet
1190364abb
Release 8.12.0
v8.12.0
2024-01-19 12:42:45 +04:00
David Kyle
64216d44fb
Add prefix_string config option to the import model hub script ( #642 )
2024-01-19 12:06:57 +04:00
Liam Thompson
0a6e3db157
[DOCS] Make online retail notebook runnable in Colab ( #641 )
...
* Make online retail notebook runnable in Colab
* Fix broken query
2024-01-18 15:55:20 +04:00
Aurélien FOUCRET
5169cc926a
Improve LTR ( #651 )
...
* Ensure the feature logger is using NaN for non matching query feature extractors (consistent with ES).
* Default score is None instead of 0.
* LTR model import API improvements.
* Fix feature logger tests.
* Fix export in eland.ml.ltr
* Apply suggestions from code review
Co-authored-by: Adam Demjen <demjened@gmail.com>
* Fix supported models for LTR
---------
Co-authored-by: Adam Demjen <demjened@gmail.com>
2024-01-17 13:01:47 +04:00
Aurélien FOUCRET
d2291889f8
Fix typo ( #650 )
2024-01-12 09:34:09 -05:00
Aurélien FOUCRET
d3ed669a5e
LTR feature logger ( #648 )
2024-01-12 13:52:04 +01:00
Adam Demjen
926f0b9b5c
Add XGBRanker and transformer ( #649 )
...
* Add XGBRanker and transformer
* Map XGBoostRegressorTransformer to XGBRanker
* Add unit tests
* Remove unused import
* Revert addition of type
* Update function comment
* Distinguish objective based on model class
2024-01-11 15:48:13 -05:00
Adam Demjen
840871f9d9
Accept LTR inference config when creating model ( #645 )
...
* Support for supplying inference_config
* Fix linting errors
* Add unit test
* Add LTR type, throw exception on predict, refine test
* Add search step to LTR test
* Fix linter errors
* Update rescoring assertion in test + type defs
* Fix linting error
* Remove failing assertion
2024-01-08 09:19:03 -05:00
Aurélien FOUCRET
05c5859b8a
Adding a new movie dataset to the tests. ( #646 )
2024-01-04 16:14:56 +01:00
Aurélien FOUCRET
0f91224daf
Add 8.12 to CI and remove 8.10 ( #647 )
2024-01-04 10:06:19 -05:00
Bart Broere
927acc86ad
Small cosmetic fix to the docs ( #640 )
2023-11-30 08:34:59 +01:00
David Kyle
6ef418f465
Release 8.11.1
v8.11.1
2023-11-22 11:55:53 +01:00
David Kyle
081250cdec
Fix failed import of ST RoBERTa models ( #637 )
...
Fixes an error uploading the sentence-transformers/all-distilroberta-v1 model
which failed with "missing 2 required positional arguments: 'token_type_ids'
and 'position_ids'". The cause was that the tokenizer type was not recognised
due to a typo
2023-11-21 12:53:43 +00:00
Quentin Pradet
af26897313
Bumpy numpy and shap ( #636 )
2023-11-21 13:17:53 +01:00
David Kyle
add61a69ec
Update CI machine types to N2 ( #634 )
...
Use `n2-standard-2` for lint and doc builds
Use `n2-standard-4` for tests
2023-11-21 11:33:04 +00:00
David Kyle
b689759278
Skip model config tests ( #635 )
...
For #633
2023-11-21 11:07:55 +00:00
Liam Thompson
87d18bd850
Fix colab link ( #632 )
...
Co-authored-by: Quentin Pradet <quentin.pradet@elastic.co>
2023-11-16 10:24:06 +00:00
Quentin Pradet
dfc522eb31
Allow es-doc members to trigger CI ( #631 )
2023-11-13 11:55:39 +01:00
Liam Thompson
508de981ff
Make demo notebook runnable in Colab ( #630 )
...
* Make demo notebook runnable in Colab
* Index using IDs starting from 0
* Trivial change to trigger CI
2023-11-10 08:44:19 +01:00
Quentin Pradet
41db37246f
Release 8.11.0
v8.11.0
2023-11-08 11:51:14 +01:00
Valeriy Khakhutskyy
6cecb454e3
[ML] Better memory estimation for NLP models ( #568 )
...
This PR adds an ability to estimate per deployment and per allocation memory usage of NLP transformer models. It uses torch.profiler and performs logs the peak memory usage during the inference.
This information is then used in Elasticsearch to provision models with sufficient memory (elastic/elasticsearch#98874 ).
2023-11-06 12:18:20 +01:00
Bart Broere
28e6d92430
Stream writes in to_csv()
...
Co-authored-by: P. Sai Vinay <pvinay1998@gmail.com>
2023-11-06 11:39:31 +01:00
Quentin Pradet
adf0535608
Fix docs build
...
Some dependencies like numpy are pinned to versions that do not support
Python 3.12. Python 3.10 is the latest version supported by Eland.
2023-11-06 13:25:30 +04:00
Bart Broere
5e5f36bdf8
Deal with the mad aggregation being removed in Pandas 2 ( #602 )
2023-11-06 06:12:16 +01:00
David Kyle
5b3a83e7f2
[NLP] Support E5 small multi-lingual ( #625 )
...
Although E5 small is a BERT based model it takes 2 parameters to forward
not 4. Use the tokenizer type to decide the number of parameters
2023-10-31 17:49:43 +00:00