104 Commits

Author SHA1 Message Date
Josh Devins
463edf9f90
Fixes field caps query params exceeding HTTP limit
We want all fields from _field_caps so we just use the wildcard expression here.
It's not clear why it didn't work before but it does now. This alleviates the
need to pass all fields to _field_caps.

See: https://github.com/elastic/eland/issues/52
2019-11-19 14:08:57 +01:00
Stephen Dodson
2854eecbf4 https://github.com/elastic/eland/issues/50 fix.
Root cause was incorrect filtering of numeric values by columns.
Added more tests + fixed a doctest.
2019-11-19 12:01:53 +00:00
Stephen Dodson
9b4fe40305 Updating docs + added supported methods doc 2019-11-19 10:42:23 +00:00
Stephen Dodson
fb2a1fae7b Updated to_string/to_html docs 2019-11-18 15:27:43 +00:00
Stephen Dodson
327f43d912 Fixing issue in to_html/to_string if max_rows is set 2019-11-18 14:47:35 +00:00
Stephen Dodson
d92ed94ef0 Improve to_string/to_html/__repr__/_repr_html_ tests
Added more rigorious tests for string representation
and fixing issue with to_html.
2019-11-18 12:55:23 +00:00
stevedodson
c93d07981d
Merge pull request #47 from blaklaybul/trim-display
Tabular Display: show 10 rows if index is larger than max_rows
2019-11-18 10:35:03 +01:00
Michael Hirsch
4eea073305 includes assertion 2019-11-16 11:57:25 -05:00
Michael Hirsch
8b96f1fa2c removes errant space 2019-11-15 17:41:15 -05:00
Michael Hirsch
30d307bdaf implements min rows to truncate display for large results 2019-11-15 17:38:46 -05:00
Michael Hirsch
b0be68e1db tabular display: show 10 rows if index is larger than max_rows 2019-11-15 11:10:35 -05:00
Stephen Dodson
2f4d601932 Adding eland.read_csv
TODO - resolve issue with ordering of eland.DataFrame compared to csv
2019-11-15 15:14:12 +00:00
Stephen Dodson
f5025b9f39 Renamed ed_to_pd eland_to_pandas and added docs.
+ added some additions to .gitignore
+ removed DataFrame.squeeze for now
2019-11-15 11:21:27 +00:00
Stephen Dodson
fb2774e7c7 Adding Series.isin 2019-11-14 20:25:34 +00:00
Stephen Dodson
5a546577f4 Resolving DataFrame.query issues + more docs 2019-11-14 20:04:38 +00:00
Stephen Dodson
8b3c66a5c8 Changing LICENSE to Apache2
Changing LICENSE to Apache2 + cleaning up setup.py
2019-11-14 09:27:44 +00:00
Stephen Dodson
dff49d01fe More doc updates. 2019-11-13 18:23:43 +00:00
Stephen Dodson
e181476dfe First effort at tidying up docs. Still work-in-progress. 2019-11-12 20:26:59 +00:00
Stephen Dodson
8de7a1db7d Resolved minor PyCharm issues 2019-11-05 13:31:10 +00:00
Stephen Dodson
c1ee409a33 Major cleanup - removed modin as dependency
modin removed as a dependency and iloc feature
removed for now - TODO add back in.
2019-11-04 13:13:42 +00:00
Stephen Dodson
9dad8613d3 Fixing tests, and upgrading to pandas 0.25.1 2019-10-18 08:06:07 +00:00
Stephen Dodson
315d4c3287 Resolving some issues with import dependencies 2019-10-08 14:39:24 +00:00
Stephen Dodson
337bef1c5d Demo day notebook + minor updates added 2019-08-15 12:26:58 +00:00
Stephen Dodson
ef289bfe78 Adding partial DataFrame.query support
Only > and == currently implemented for PoC. 'query'
language not supported yet.
2019-08-14 14:44:04 +00:00
Stephen Dodson
49bad292d3 Added DataFrame.to_csv - tests still failing 2019-08-09 07:54:44 +00:00
Stephen Dodson
c6e0c5b92b Adding smaller test and first effort to implement aggs 2019-08-06 14:58:38 +00:00
Stephen Dodson
67b7aee9c9 Adding DataFrame.hist tests and DataFrame.select_dtypes 2019-08-01 12:55:17 +00:00
Stephen Dodson
3435ffac1b Adding first implementation of eland.DataFrame.hist 2019-07-31 09:59:52 +00:00
Stephen Dodson
1fa4d3fbe7 Partial implementation of hist - does not work
Backup push
2019-07-12 15:24:32 +00:00
Stephen Dodson
9bf3505b7e Cleanup + removed dependence of elasticsearch-dsl 2019-07-11 10:17:04 +00:00
Stephen Dodson
ee9b621891 Removing redundant test 2019-07-11 10:13:00 +00:00
Stephen Dodson
d71ce9f50c Adding drop + the ability for operations to have a query
Significant refactor - needs cleanup
2019-07-11 10:11:57 +00:00
Stephen Dodson
dc07285aa1 Merge branch 'master' of github.com:stevedodson/eland 2019-07-09 10:03:08 +00:00
Stephen Dodson
a73c999290 iloc is (mainly) working. 2019-07-09 10:02:08 +00:00
stevedodson
e87bfea709
Update dataframe_supported.rst 2019-07-05 07:15:33 +00:00
Stephen Dodson
245def48e9 Added docs/dataframe_supported.rst 2019-07-04 19:38:26 +00:00
Stephen Dodson
d0ea715c31 Added test data and additional test cases 2019-07-04 19:25:47 +00:00
Stephen Dodson
15e0c37182 Major refactor. eland is now backed by modin.
First push, still not functional.
2019-07-04 13:00:19 +00:00
Stephen Dodson
5e10b2e818 Checkpoint code before attempting major investigation into using modin 2019-07-03 09:49:58 +00:00
Stephen Dodson
30df901fce Introduction of eland.Series - big refactor
Creation of eland.NDFrame as base class for DataFrame and Series
2019-07-01 18:41:56 +00:00
Stephen Dodson
c4d2683743 Adding eland.Index features 2019-06-28 14:43:20 +00:00
Stephen Dodson
428f35f21f Added DataFrame.info() + more methods 2019-06-26 12:25:08 +00:00
Stephen Dodson
9030f84f4c Added __getitem__
Implementation copies DataFrame and changes underlying mappings
object.
2019-06-25 08:41:25 +00:00
Stephen Dodson
d4250640f1 Adding __repr__ method based on SparseDataFrames 2019-06-24 13:34:06 +00:00
Stephen Dodson
c723633526 Resolving merge issue 2019-06-22 06:55:30 +00:00
stevedodson
9c61a71a81
Merge branch 'master' into feature/pep8ify 2019-06-22 08:50:05 +02:00
Stephen Dodson
5aff6966c3 Re-add DataFrame.describe() + tests 2019-06-21 14:38:29 +00:00
Stephen Dodson
0fa3f81bbb Major refactor
Addition of new mapping module to manage mapping
between pandas and ES. More tests.
2019-06-21 10:23:58 +00:00
Stephen Dodson
674ac129e6 Renaming modules and added mapping store 2019-06-18 11:48:56 +00:00
Winterflower
52cf04a97f Renaming modules to lowercase 2019-06-18 10:54:26 +02:00