167 Commits

Author SHA1 Message Date
Stephen Dodson
e755a2e160 Minor doc fix for Series.to_string 2019-11-22 16:29:51 +00:00
Stephen Dodson
91c811345c Minor updates to docs and doctests 2019-11-22 16:22:16 +00:00
Stephen Dodson
84e23ab5d1 Added Series metric aggs + Series docs
Also, improved Series.to_string()
2019-11-22 15:44:55 +00:00
Stephen Dodson
5d119215f8 Fixing rename and truediv issues
tests pass
TODO - implement additional orithmetic ops
2019-11-21 20:37:54 +00:00
Stephen Dodson
c12bf9357b Series rename and arithmetic initial implementation
Partially implemented, tests fail with this commit.
2019-11-21 15:39:13 +00:00
Stephen Dodson
6564f26245 Adding 'development' section to docs
Adding contributing section based on Elasticsearch/CONTRIBUTING.md
TODO - add testing docs (based on CI)1
2019-11-20 10:32:35 +00:00
stevedodson
2a409962ea
Merge pull request #55 from blaklaybul/fix-boolean-index
Instantiates Column as Series with Specified dtype
2019-11-20 08:14:59 +00:00
Michael Hirsch
f1ec6c0d8b fixes UnboundLocalError when displaying empty dataframes 2019-11-19 15:52:03 -05:00
Michael Hirsch
c90602dd65 sets max_rows=1 in case of empty dataframe 2019-11-19 15:13:18 -05:00
Michael Hirsch
9c03d5a0d4 instantiates column as series with specified dtype 2019-11-19 13:13:08 -05:00
Michael Hirsch
9c9ca90c0d
Adds Support for Series.value_counts() (#49)
* adds support for series.value_counts

* adds docs for series.value_counts

* adds tests for series.value_counts

* updates keyerror language

* adds es docs as an external source

* adds parameters for metrics and terms aggs

* adds 2 tests to check for exceptions

* explains the size parameter

* removes print statements from tests

* checks that es_size is a positive integer

* implements assert_series_equal
2019-11-19 11:27:15 -05:00
stevedodson
885a0a4aba
Merge pull request #51 from stevedodson/master
Updating docs + added supported methods doc
2019-11-19 14:09:13 +00:00
Stephen Dodson
8f31c04c07 Extending metrics tests based on comments on https://github.com/elastic/eland/pull/51 2019-11-19 13:49:34 +00:00
Josh Devins
463edf9f90
Fixes field caps query params exceeding HTTP limit
We want all fields from _field_caps so we just use the wildcard expression here.
It's not clear why it didn't work before but it does now. This alleviates the
need to pass all fields to _field_caps.

See: https://github.com/elastic/eland/issues/52
2019-11-19 14:08:57 +01:00
Stephen Dodson
2854eecbf4 https://github.com/elastic/eland/issues/50 fix.
Root cause was incorrect filtering of numeric values by columns.
Added more tests + fixed a doctest.
2019-11-19 12:01:53 +00:00
Stephen Dodson
9b4fe40305 Updating docs + added supported methods doc 2019-11-19 10:42:23 +00:00
Stephen Dodson
fb2a1fae7b Updated to_string/to_html docs 2019-11-18 15:27:43 +00:00
Stephen Dodson
327f43d912 Fixing issue in to_html/to_string if max_rows is set 2019-11-18 14:47:35 +00:00
Stephen Dodson
d92ed94ef0 Improve to_string/to_html/__repr__/_repr_html_ tests
Added more rigorious tests for string representation
and fixing issue with to_html.
2019-11-18 12:55:23 +00:00
stevedodson
c93d07981d
Merge pull request #47 from blaklaybul/trim-display
Tabular Display: show 10 rows if index is larger than max_rows
2019-11-18 10:35:03 +01:00
Michael Hirsch
4eea073305 includes assertion 2019-11-16 11:57:25 -05:00
Michael Hirsch
8b96f1fa2c removes errant space 2019-11-15 17:41:15 -05:00
Michael Hirsch
30d307bdaf implements min rows to truncate display for large results 2019-11-15 17:38:46 -05:00
Michael Hirsch
b0be68e1db tabular display: show 10 rows if index is larger than max_rows 2019-11-15 11:10:35 -05:00
Stephen Dodson
2f4d601932 Adding eland.read_csv
TODO - resolve issue with ordering of eland.DataFrame compared to csv
2019-11-15 15:14:12 +00:00
Stephen Dodson
f5025b9f39 Renamed ed_to_pd eland_to_pandas and added docs.
+ added some additions to .gitignore
+ removed DataFrame.squeeze for now
2019-11-15 11:21:27 +00:00
Stephen Dodson
fb2774e7c7 Adding Series.isin 2019-11-14 20:25:34 +00:00
Stephen Dodson
5a546577f4 Resolving DataFrame.query issues + more docs 2019-11-14 20:04:38 +00:00
Stephen Dodson
8b3c66a5c8 Changing LICENSE to Apache2
Changing LICENSE to Apache2 + cleaning up setup.py
2019-11-14 09:27:44 +00:00
Stephen Dodson
dff49d01fe More doc updates. 2019-11-13 18:23:43 +00:00
Stephen Dodson
e181476dfe First effort at tidying up docs. Still work-in-progress. 2019-11-12 20:26:59 +00:00
Stephen Dodson
8de7a1db7d Resolved minor PyCharm issues 2019-11-05 13:31:10 +00:00
Stephen Dodson
c1ee409a33 Major cleanup - removed modin as dependency
modin removed as a dependency and iloc feature
removed for now - TODO add back in.
2019-11-04 13:13:42 +00:00
Stephen Dodson
9dad8613d3 Fixing tests, and upgrading to pandas 0.25.1 2019-10-18 08:06:07 +00:00
Stephen Dodson
315d4c3287 Resolving some issues with import dependencies 2019-10-08 14:39:24 +00:00
Stephen Dodson
337bef1c5d Demo day notebook + minor updates added 2019-08-15 12:26:58 +00:00
Stephen Dodson
ef289bfe78 Adding partial DataFrame.query support
Only > and == currently implemented for PoC. 'query'
language not supported yet.
2019-08-14 14:44:04 +00:00
Stephen Dodson
49bad292d3 Added DataFrame.to_csv - tests still failing 2019-08-09 07:54:44 +00:00
Stephen Dodson
c6e0c5b92b Adding smaller test and first effort to implement aggs 2019-08-06 14:58:38 +00:00
Stephen Dodson
67b7aee9c9 Adding DataFrame.hist tests and DataFrame.select_dtypes 2019-08-01 12:55:17 +00:00
Stephen Dodson
3435ffac1b Adding first implementation of eland.DataFrame.hist 2019-07-31 09:59:52 +00:00
Stephen Dodson
1fa4d3fbe7 Partial implementation of hist - does not work
Backup push
2019-07-12 15:24:32 +00:00
Stephen Dodson
9bf3505b7e Cleanup + removed dependence of elasticsearch-dsl 2019-07-11 10:17:04 +00:00
Stephen Dodson
ee9b621891 Removing redundant test 2019-07-11 10:13:00 +00:00
Stephen Dodson
d71ce9f50c Adding drop + the ability for operations to have a query
Significant refactor - needs cleanup
2019-07-11 10:11:57 +00:00
Stephen Dodson
dc07285aa1 Merge branch 'master' of github.com:stevedodson/eland 2019-07-09 10:03:08 +00:00
Stephen Dodson
a73c999290 iloc is (mainly) working. 2019-07-09 10:02:08 +00:00
stevedodson
e87bfea709
Update dataframe_supported.rst 2019-07-05 07:15:33 +00:00
Stephen Dodson
245def48e9 Added docs/dataframe_supported.rst 2019-07-04 19:38:26 +00:00
Stephen Dodson
d0ea715c31 Added test data and additional test cases 2019-07-04 19:25:47 +00:00