Benjamin Trent
efb9e3b4c4
[ML] Add support for multi:softmax|softprob XGBClassifier
2020-08-06 12:04:10 -05:00
Seth Michael Larson
5c901e8f1b
Create pytest fixture for testing behavior of Eland vs Pandas
2020-07-28 16:47:22 -05:00
Seth Michael Larson
140623283a
Support Series/collections in Series.isin(), add type hints
2020-07-14 11:39:52 -05:00
Seth Michael Larson
6e6ad04c5c
Use 'script.context.field.max_compilations_rate' instead of deprecated setting
2020-07-14 09:51:35 -05:00
Seth Michael Larson
6c2f9a2ed2
Add DataFrame.size and Series.size
2020-07-13 17:30:14 -05:00
Seth Michael Larson
d50e06dda5
Add webinar recording link to notebook
2020-07-10 14:21:55 -05:00
Seth Michael Larson
ceacf759c3
Add long Apache-2.0 license header to all files
2020-07-08 15:10:43 -05:00
Seth Michael Larson
5897b4587c
Add webinar example notebook, update prose in docs
2020-07-08 14:44:40 -05:00
Seth Michael Larson
de9c836c5e
Error when MLModel.predict fails, add es_compress_model_definition
2020-07-08 14:31:27 -05:00
Léonard Binet
5d0df757cf
Add column names to DataFrame.__dir__ for better auto-completion support
2020-07-02 08:49:52 -05:00
Seth Michael Larson
f63941014f
Add support for es_if_exists='append' to pandas_to_eland()
2020-06-15 09:50:44 -05:00
Seth Michael Larson
ad2e012f1e
Release 7.7.0a1
7.7.0a1
2020-05-20 13:58:40 -05:00
Seth Michael Larson
eff9625be1
Update docs with all new APIs
2020-05-20 13:58:40 -05:00
Seth Michael Larson
6000ea73d0
Add [DataFrame, Series].filter()
2020-05-20 12:45:30 -05:00
Daniel Mesejo-León
890cf6dc97
Add Series.isna() and Series.notna()
2020-05-19 16:12:59 -05:00
Seth Michael Larson
1378544933
Normalize and prune top-level APIs
2020-05-18 14:55:41 -05:00
Seth Michael Larson
d1444f8e09
Add Conda Forge installation instructions
2020-05-15 15:27:41 -05:00
Seth Michael Larson
6ca41585e9
Upgrade to elasticsearch-py v7.7
2020-05-14 10:07:10 -05:00
Seth Michael Larson
d2047aa51a
Make ML libraries optional, fix type issues
2020-05-14 09:31:01 -05:00
Daniel Mesejo-León
bfd0ee6f90
Fix DataFrame.shape when smaller than its SizedTask
2020-05-06 13:59:47 -05:00
Daniel Mesejo-León
94dbb36081
Add .sample() method to DataFrame and Series
2020-05-04 12:07:21 -05:00
Seth Michael Larson
def3a46af9
Fix bug when combining AndFilter with OrFilter
2020-05-04 07:39:05 -05:00
Seth Michael Larson
fa8dbe0eb4
Restore documentation requirements
2020-04-29 13:57:51 -05:00
Seth Michael Larson
3d81def5cc
Add support for xgboost v1
2020-04-29 13:06:35 -05:00
Seth Michael Larson
df2a21ffd4
Make QueryParams a dataclass
2020-04-27 16:21:26 -05:00
Seth Michael Larson
15a1977dcf
Add agg compatibility logic to Field class
2020-04-27 15:16:48 -05:00
Seth Michael Larson
7946eb4daa
Add an enforce license headers
2020-04-25 16:26:58 -05:00
Seth Michael Larson
33b4976f9a
Add type hints to base modules
2020-04-24 12:39:13 -05:00
Daniel Mesejo-León
fe6589ae6a
Change ScriptFilter from inline to source for script caching
2020-04-21 07:41:56 -05:00
Daniel Mesejo-León
a779f04a6d
Add default dtype to empty pd.Series
...
Suppress pandas DeprecationWarning with default dtype on empty pd.Series
2020-04-19 08:51:10 -05:00
Stephen Dodson
1bc83d15e7
Change var/std aggs to use sample instead of population
2020-04-15 14:16:12 -05:00
Seth Michael Larson
e71420c883
Release 7.6.0a5
7.6.0a5
2020-04-14 11:07:32 -05:00
Stephen Dodson
50734f8bd9
Allow user to specify es data types in read_csv and pandas_to_eland ( #181 )
...
* Allow user to specify es data types in read_csv and pandas_to_eland
Also, some minor maintenance modifications:
- replaced pandas.util.testing with pandas.testing (required in 1.x)
- updated elasticsearch-py requirements to 7.6+ (to support ML code)
* linting file
2020-04-14 15:04:12 +00:00
Seth Michael Larson
e1cacead44
Add 'inference_config' on ES >=7.8
2020-04-14 07:51:50 -05:00
Seth Michael Larson
448770df78
Restrict public API, update license header
2020-04-14 07:31:23 -05:00
Daniel Mesejo-León
e8f307d2e0
Add NDFrame.median() aggregation
2020-04-13 08:48:39 -05:00
Daniel Mesejo-León
7a1c636e56
Add NDFrame.var() and .std() aggregations
2020-04-12 15:48:13 -05:00
Seth Michael Larson
064d43b9ef
Remove eland.Client, use Elasticsearch directly
2020-04-06 07:25:25 -05:00
Seth Michael Larson
29af76101e
Fix unpacking of median aggregation
2020-04-03 07:56:09 -05:00
Daniel Mesejo-León
023a35c3b4
Add instructions for how to build docs
2020-04-03 07:53:27 -05:00
Seth Michael Larson
c8bd25cbea
Add doctests to CI
2020-04-02 13:06:22 -05:00
Seth Michael Larson
7e5f0d3913
Add DataFrame.es_query() to query Elasticsearch directly
2020-04-02 13:06:22 -05:00
Seth Michael Larson
38251ddf08
No spaces in delimiters for serialized ML model
2020-04-02 07:40:51 -05:00
Stephen Dodson
71f2a3f793
Added 'use_pandas_index_for_es_ids' param to pandas_to_eland()
2020-03-31 09:20:47 -05:00
Daniel Mesejo-León
03582b9f5e
Import __version__ and other metadata by name
2020-03-30 07:45:04 -05:00
Seth Michael Larson
790e2b0de8
Update README with supported versions, pandas v1 outputs
2020-03-27 13:13:50 -05:00
Daniel Mesejo-León
e27a508c59
Update supported Pandas to v1.0
2020-03-27 12:21:15 -05:00
Seth Michael Larson
0c1d7222fe
Drop support for Python 3.5, add Black
2020-03-27 07:56:28 -05:00
Stephen Dodson
9e2997c00d
Bug/is scripted error ( #149 )
...
* Updating test matrix for 7.6 + removing oss for now.
* Resolving 7.6.0 docs issues
* Updating ML docs
* Minor mod to support 6.x style indices.
Currently, there is no specific test for this as
it requires a 6.x cluster. 6.x is not officially
supported by 7.x clients, but this is a friendly
option for users.
* Adding unittest for FieldMappings._extract_fields_from_mapping
* Changing to f-string formatting and adding exception test
* Reverting to OrderedDict
Will change after https://github.com/elastic/eland/pull/150 is merged.
2020-03-26 15:17:10 +00:00
Seth Michael Larson
2e74a56c0a
Release v7.6.0a4
7.6.0a4
2020-03-23 08:43:59 -05:00