559 Commits

Author SHA1 Message Date
Jabin Kong
77f9a455e9
Fix docstring formatting 2021-09-07 11:40:19 -05:00
P. Sai Vinay
315f94b201
Add excluded lines for coverage and improve coverage 2021-09-07 11:39:19 -05:00
Seth Michael Larson
a50c3657c4
Release v7.14.1b1 v7.14.1b1 2021-08-30 13:42:55 -05:00
Seth Michael Larson
7a2e845a76
Speedup CI by only installing Nox in Dockerfile 2021-08-20 08:39:02 -05:00
Jabin Kong
1aa193da9e
Add iterrows() and itertuples() to DataFrame
Co-authored-by: Seth Michael Larson <seth.larson@elastic.co>
2021-08-20 08:34:52 -05:00
Seth Michael Larson
e4f88a34a6
Yield list of hits from _search_yield_hits() instead of individual hits 2021-08-17 12:16:10 -05:00
P. Sai Vinay
011bf29816
Simplify ES->pandas logic by removing Collectors 2021-08-16 12:22:02 -05:00
Seth Michael Larson
76d83ea47f
Bump version to 7.14.0b1 v7.14.0b1 2021-08-09 09:21:49 -05:00
Seth Michael Larson
b0c8434c06
Release 7.14.0b1 2021-08-09 09:11:57 -05:00
Seth Michael Larson
15ba8d3e02
Fallback on using scroll searches for Elasticsearch <7.12
PIT+search_after became universally safe in Elasticsearch 7.12 by adding an automatic sort tiebreaker field when using PITs called `_shard_doc` but now we need to do feature detection to make sure we use the previous scroll method on Elasticsearch <7.12 clusters
2021-08-08 12:19:41 -05:00
P. Sai Vinay
30876c8899
Switch to Point-in-Time with search_after instead of using scroll APIs
Co-authored-by: Seth Michael Larson <seth.larson@elastic.co>
2021-08-07 16:05:33 -05:00
P. Sai Vinay
8f84a315be
Add test case for pseudohubererror for XGBoost 2021-08-06 15:59:48 -05:00
P. Sai Vinay
d3f8d7b8f6
Optimize FieldMappings.aggregate_field_name() method 2021-08-06 11:27:59 -05:00
Seth Michael Larson
54b497ed9a
Update supported versions of Python, pandas, and Elasticsearch 2021-08-04 13:21:17 -05:00
P. Sai Vinay
823f01cc6c
Add type hints to 'eland.operations' and 'eland.ndframe' 2021-08-02 11:50:35 -05:00
P. Sai Vinay
c0e861dc77
Fix installed pandas version on Jenkins 2021-07-31 12:51:11 -05:00
P. Sai Vinay
4c1af42c14
Add idxmax and idxmin methods to DataFrame 2021-07-28 07:55:26 -05:00
Seth Michael Larson
c74fccbd74
Drop support for Python 3.6, pandas<1.2 2021-07-27 14:43:03 -05:00
P. Sai Vinay
193bcb73ef
Add support for Pandas v1.3 and LightGBM v3.x 2021-07-27 11:01:35 -05:00
P. Sai Vinay
22475cdc46
Add PANDAS_VERSION to Jenkins matrix 2021-07-26 11:17:46 -05:00
Seth Michael Larson
1555ea9534
Fix typo in version number
Should be `7.13.0b1` instead of `7.13.1b1`
v7.13.0b1
2021-06-22 12:03:46 -05:00
Seth Michael Larson
16178dfb5d
Release 7.13.0b1 2021-06-22 11:59:27 -05:00
P. Sai Vinay
ac2efb5863
Optimize df.describe() to use aggregations instead of own query 2021-06-22 11:29:54 -05:00
P. Sai Vinay
5fe32a24df
Add quantile() to DataFrameGroupBy 2021-06-22 10:54:33 -05:00
P. Sai Vinay
7e8520a8ef
Remove deprecated code in XGBoost and test suite 2021-06-08 15:19:56 -05:00
P. Sai Vinay
e9c0b897f5
Add quantile() to DataFrame and Series 2021-06-08 13:02:44 -05:00
P. Sai Vinay
aa9d60e7e7
Add sort order to groupby dropna=False (#322)
* Add sort order to groupby dropna=False

* Fix rebase
2021-04-21 13:24:52 +00:00
Stephen Dodson
1040160451
Fix bugs with field mapping and lint issue (#346)
* Fix bugs with field mapping:

1. If no permission to call _mapping, return readable error
2. If index is wildcard, fix issues with user warnings

* Fixing lint issues

* Removing trailing backslashes in doc

* Remove pandas/matplotlib deprecation warning

This warning is due to a conflict between
pandas/matplotlib that may be resolved in a later
version. For now, ignore the warning so CI works.
2021-03-30 14:49:54 +00:00
Seth Michael Larson
985afe74e0
Release 7.10.1b1 7.10.1b1 2021-01-12 12:36:23 -06:00
Seth Michael Larson
26354622b5
Add more sections for elastic.co/guide 2021-01-12 10:26:01 -06:00
P. Sai Vinay
421d84fd20
Add mode() method to DataFrame and Series 2021-01-07 12:17:10 -06:00
P. Sai Vinay
27717eead1
Remove deprecated options and aliases 2021-01-04 13:20:45 -06:00
P. Sai Vinay
f89d79b1b4
Fix py.typed include in MANIFEST.in
Co-authored-by: Seth Michael Larson <seth.larson@elastic.co>
2020-12-30 15:07:40 -06:00
Seth Michael Larson
a552504f9b
Add support for Pandas 1.2.0 2020-12-30 14:20:36 -06:00
P. Sai Vinay
473db4576b
Move tests directory outside of eland namespace 2020-11-16 11:30:41 -06:00
P. Sai Vinay
56f6ba6c8b
Add Elasticsearch storage usage to df.info() 2020-11-16 10:07:28 -06:00
P. Sai Vinay
789f8959bc
Add support for pd.set_option("display.max_rows", None) 2020-11-06 12:23:09 -06:00
P. Sai Vinay
75451f1e93
Add pytest-cov for coverage tracking 2020-11-06 11:34:15 -06:00
P. Sai Vinay
4e92e3cf62
Fix Eland logo and update contributing documentation 2020-11-06 09:33:30 -06:00
Seth Michael Larson
31760fe02c
Release 7.10.0b1 7.10.0b1 2020-10-29 13:43:34 -05:00
Seth Michael Larson
b936e98012
Allow dict in es_type_overrides, text fields by default get keyword sub-field 2020-10-29 13:16:42 -05:00
Seth Michael Larson
cb4cd083c3
Add support for es_match() to DataFrame and Series 2020-10-29 10:16:50 -05:00
Seth Michael Larson
92a8040614
Test against Elasticsearch 7.10 2020-10-28 09:03:46 -05:00
Seth Michael Larson
ae96558075
Add source for 'elastic.co/guide' to 'docs/guide' 2020-10-28 07:57:10 -05:00
Seth Michael Larson
95b8d75e37
Fix 'Series.__repr__()' when the series is empty 2020-10-27 17:08:37 -05:00
P. Sai Vinay
54468cb85b
Add pytest --nbval of notebook examples to CI 2020-10-27 15:15:04 -05:00
P. Sai Vinay
e17b4e03ea
Error when es_type_overrides receives unknown columns 2020-10-27 13:48:31 -05:00
Seth Michael Larson
28951c0ad1
Add linting+docs to GitHub Actions, fix docs 2020-10-27 11:28:55 -05:00
Seth Michael Larson
ae70f03df3
Document DataFrame.groupby() methods 2020-10-27 10:10:57 -05:00
P. Sai Vinay
475e0f41ef
Implement DataFrameGroupBy.count() 2020-10-23 08:41:50 -05:00