26 Commits

Author SHA1 Message Date
stevedodson
f06219f0ec
Feature/refactor tasks (#83)
* Significant refactor of task list in operations.py

Classes based on composite pattern replace tuples for
tasks.

* Addressing review comments for eland/operations.py

* Minor update to review fixes

* Minor fix for some better handling of non-aggregatable fields: https://github.com/elastic/eland/issues/71

* Test for non-aggrgatable value_counts

* Refactoring tasks/actions

* Removing debug and fixing doctest
2019-12-06 08:46:43 +00:00
Francesco Vigliaturo
99bfea42b6
Added support for 2 date formats: (#70)
* Adds support for multiple date formats
2019-12-04 17:42:50 +01:00
Stephen Dodson
bf6c56878a Correcting license files + fixing bug in filter
LICENSE and NOTICE conform to Elastic policy. Bug in
nested negated filters fixed.

Also, some limited cleanup.
2019-12-03 13:56:49 +00:00
Michael Hirsch
a3dd86075a
String Arithmetics: __add__ ops (#68)
* adds support for __add__ ops for string objects and literals

* adds tests for string arithmetic

* updates comment in numeric field resolution

* adds op_type parameter for numeric_ops
2019-11-27 10:44:17 -05:00
Stephen Dodson
86686ebb18 Reformat and cleanup based on PyCharm 2019-11-26 11:02:46 +00:00
Stephen Dodson
ac8cb302de Updates based on PR review. 2019-11-25 12:43:37 +00:00
Stephen Dodson
84e23ab5d1 Added Series metric aggs + Series docs
Also, improved Series.to_string()
2019-11-22 15:44:55 +00:00
Stephen Dodson
5d119215f8 Fixing rename and truediv issues
tests pass
TODO - implement additional orithmetic ops
2019-11-21 20:37:54 +00:00
Stephen Dodson
c12bf9357b Series rename and arithmetic initial implementation
Partially implemented, tests fail with this commit.
2019-11-21 15:39:13 +00:00
Michael Hirsch
9c03d5a0d4 instantiates column as series with specified dtype 2019-11-19 13:13:08 -05:00
Michael Hirsch
9c9ca90c0d
Adds Support for Series.value_counts() (#49)
* adds support for series.value_counts

* adds docs for series.value_counts

* adds tests for series.value_counts

* updates keyerror language

* adds es docs as an external source

* adds parameters for metrics and terms aggs

* adds 2 tests to check for exceptions

* explains the size parameter

* removes print statements from tests

* checks that es_size is a positive integer

* implements assert_series_equal
2019-11-19 11:27:15 -05:00
Stephen Dodson
2f4d601932 Adding eland.read_csv
TODO - resolve issue with ordering of eland.DataFrame compared to csv
2019-11-15 15:14:12 +00:00
Stephen Dodson
f5025b9f39 Renamed ed_to_pd eland_to_pandas and added docs.
+ added some additions to .gitignore
+ removed DataFrame.squeeze for now
2019-11-15 11:21:27 +00:00
Stephen Dodson
dff49d01fe More doc updates. 2019-11-13 18:23:43 +00:00
Stephen Dodson
8de7a1db7d Resolved minor PyCharm issues 2019-11-05 13:31:10 +00:00
Stephen Dodson
c1ee409a33 Major cleanup - removed modin as dependency
modin removed as a dependency and iloc feature
removed for now - TODO add back in.
2019-11-04 13:13:42 +00:00
Stephen Dodson
337bef1c5d Demo day notebook + minor updates added 2019-08-15 12:26:58 +00:00
Stephen Dodson
ef289bfe78 Adding partial DataFrame.query support
Only > and == currently implemented for PoC. 'query'
language not supported yet.
2019-08-14 14:44:04 +00:00
Stephen Dodson
49bad292d3 Added DataFrame.to_csv - tests still failing 2019-08-09 07:54:44 +00:00
Stephen Dodson
c6e0c5b92b Adding smaller test and first effort to implement aggs 2019-08-06 14:58:38 +00:00
Stephen Dodson
67b7aee9c9 Adding DataFrame.hist tests and DataFrame.select_dtypes 2019-08-01 12:55:17 +00:00
Stephen Dodson
3435ffac1b Adding first implementation of eland.DataFrame.hist 2019-07-31 09:59:52 +00:00
Stephen Dodson
1fa4d3fbe7 Partial implementation of hist - does not work
Backup push
2019-07-12 15:24:32 +00:00
Stephen Dodson
d71ce9f50c Adding drop + the ability for operations to have a query
Significant refactor - needs cleanup
2019-07-11 10:11:57 +00:00
Stephen Dodson
a73c999290 iloc is (mainly) working. 2019-07-09 10:02:08 +00:00
Stephen Dodson
15e0c37182 Major refactor. eland is now backed by modin.
First push, still not functional.
2019-07-04 13:00:19 +00:00