279 Commits

Author SHA1 Message Date
Benjamin Trent
efb9e3b4c4
[ML] Add support for multi:softmax|softprob XGBClassifier 2020-08-06 12:04:10 -05:00
Seth Michael Larson
5c901e8f1b
Create pytest fixture for testing behavior of Eland vs Pandas 2020-07-28 16:47:22 -05:00
Seth Michael Larson
140623283a
Support Series/collections in Series.isin(), add type hints 2020-07-14 11:39:52 -05:00
Seth Michael Larson
6e6ad04c5c
Use 'script.context.field.max_compilations_rate' instead of deprecated setting 2020-07-14 09:51:35 -05:00
Seth Michael Larson
6c2f9a2ed2
Add DataFrame.size and Series.size 2020-07-13 17:30:14 -05:00
Seth Michael Larson
d50e06dda5
Add webinar recording link to notebook 2020-07-10 14:21:55 -05:00
Seth Michael Larson
ceacf759c3
Add long Apache-2.0 license header to all files 2020-07-08 15:10:43 -05:00
Seth Michael Larson
5897b4587c
Add webinar example notebook, update prose in docs 2020-07-08 14:44:40 -05:00
Seth Michael Larson
de9c836c5e
Error when MLModel.predict fails, add es_compress_model_definition 2020-07-08 14:31:27 -05:00
Léonard Binet
5d0df757cf
Add column names to DataFrame.__dir__ for better auto-completion support 2020-07-02 08:49:52 -05:00
Seth Michael Larson
f63941014f Add support for es_if_exists='append' to pandas_to_eland() 2020-06-15 09:50:44 -05:00
Seth Michael Larson
ad2e012f1e Release 7.7.0a1 7.7.0a1 2020-05-20 13:58:40 -05:00
Seth Michael Larson
eff9625be1 Update docs with all new APIs 2020-05-20 13:58:40 -05:00
Seth Michael Larson
6000ea73d0
Add [DataFrame, Series].filter() 2020-05-20 12:45:30 -05:00
Daniel Mesejo-León
890cf6dc97
Add Series.isna() and Series.notna() 2020-05-19 16:12:59 -05:00
Seth Michael Larson
1378544933
Normalize and prune top-level APIs 2020-05-18 14:55:41 -05:00
Seth Michael Larson
d1444f8e09 Add Conda Forge installation instructions 2020-05-15 15:27:41 -05:00
Seth Michael Larson
6ca41585e9
Upgrade to elasticsearch-py v7.7 2020-05-14 10:07:10 -05:00
Seth Michael Larson
d2047aa51a
Make ML libraries optional, fix type issues 2020-05-14 09:31:01 -05:00
Daniel Mesejo-León
bfd0ee6f90
Fix DataFrame.shape when smaller than its SizedTask 2020-05-06 13:59:47 -05:00
Daniel Mesejo-León
94dbb36081
Add .sample() method to DataFrame and Series 2020-05-04 12:07:21 -05:00
Seth Michael Larson
def3a46af9
Fix bug when combining AndFilter with OrFilter 2020-05-04 07:39:05 -05:00
Seth Michael Larson
fa8dbe0eb4
Restore documentation requirements 2020-04-29 13:57:51 -05:00
Seth Michael Larson
3d81def5cc
Add support for xgboost v1 2020-04-29 13:06:35 -05:00
Seth Michael Larson
df2a21ffd4
Make QueryParams a dataclass 2020-04-27 16:21:26 -05:00
Seth Michael Larson
15a1977dcf
Add agg compatibility logic to Field class 2020-04-27 15:16:48 -05:00
Seth Michael Larson
7946eb4daa
Add an enforce license headers 2020-04-25 16:26:58 -05:00
Seth Michael Larson
33b4976f9a
Add type hints to base modules 2020-04-24 12:39:13 -05:00
Daniel Mesejo-León
fe6589ae6a
Change ScriptFilter from inline to source for script caching 2020-04-21 07:41:56 -05:00
Daniel Mesejo-León
a779f04a6d
Add default dtype to empty pd.Series
Suppress pandas DeprecationWarning with default dtype on empty pd.Series
2020-04-19 08:51:10 -05:00
Stephen Dodson
1bc83d15e7
Change var/std aggs to use sample instead of population 2020-04-15 14:16:12 -05:00
Seth Michael Larson
e71420c883
Release 7.6.0a5 7.6.0a5 2020-04-14 11:07:32 -05:00
Stephen Dodson
50734f8bd9
Allow user to specify es data types in read_csv and pandas_to_eland (#181)
* Allow user to specify es data types in read_csv and pandas_to_eland

Also, some minor maintenance modifications:

- replaced pandas.util.testing with pandas.testing (required in 1.x)
- updated elasticsearch-py requirements to 7.6+ (to support ML code)

* linting file
2020-04-14 15:04:12 +00:00
Seth Michael Larson
e1cacead44
Add 'inference_config' on ES >=7.8 2020-04-14 07:51:50 -05:00
Seth Michael Larson
448770df78
Restrict public API, update license header 2020-04-14 07:31:23 -05:00
Daniel Mesejo-León
e8f307d2e0
Add NDFrame.median() aggregation 2020-04-13 08:48:39 -05:00
Daniel Mesejo-León
7a1c636e56
Add NDFrame.var() and .std() aggregations 2020-04-12 15:48:13 -05:00
Seth Michael Larson
064d43b9ef
Remove eland.Client, use Elasticsearch directly 2020-04-06 07:25:25 -05:00
Seth Michael Larson
29af76101e
Fix unpacking of median aggregation 2020-04-03 07:56:09 -05:00
Daniel Mesejo-León
023a35c3b4
Add instructions for how to build docs 2020-04-03 07:53:27 -05:00
Seth Michael Larson
c8bd25cbea Add doctests to CI 2020-04-02 13:06:22 -05:00
Seth Michael Larson
7e5f0d3913 Add DataFrame.es_query() to query Elasticsearch directly 2020-04-02 13:06:22 -05:00
Seth Michael Larson
38251ddf08
No spaces in delimiters for serialized ML model 2020-04-02 07:40:51 -05:00
Stephen Dodson
71f2a3f793
Added 'use_pandas_index_for_es_ids' param to pandas_to_eland() 2020-03-31 09:20:47 -05:00
Daniel Mesejo-León
03582b9f5e
Import __version__ and other metadata by name 2020-03-30 07:45:04 -05:00
Seth Michael Larson
790e2b0de8
Update README with supported versions, pandas v1 outputs 2020-03-27 13:13:50 -05:00
Daniel Mesejo-León
e27a508c59
Update supported Pandas to v1.0 2020-03-27 12:21:15 -05:00
Seth Michael Larson
0c1d7222fe
Drop support for Python 3.5, add Black 2020-03-27 07:56:28 -05:00
Stephen Dodson
9e2997c00d
Bug/is scripted error (#149)
* Updating test matrix for 7.6 + removing oss for now.

* Resolving 7.6.0 docs issues

* Updating ML docs

* Minor mod to support 6.x style indices.

Currently, there is no specific test for this as
it requires a 6.x cluster. 6.x is not officially
supported by 7.x clients, but this is a friendly
option for users.

* Adding unittest for FieldMappings._extract_fields_from_mapping

* Changing to f-string formatting and adding exception test

* Reverting to OrderedDict

Will change after https://github.com/elastic/eland/pull/150 is merged.
2020-03-26 15:17:10 +00:00
Seth Michael Larson
2e74a56c0a
Release v7.6.0a4 7.6.0a4 2020-03-23 08:43:59 -05:00