221 Commits

Author SHA1 Message Date
stevedodson
fa930b6cea
7.6.0a2 (#130)
* Updating test matrix for 7.6 + removing oss for now.

* Resolving 7.6.0 docs issues

* Updating ML docs

* Bumping version following doc fixes
7.6.0a2
2020-02-15 20:10:41 +01:00
stevedodson
163d18d84e
Updating ML docs (#129)
* Updating test matrix for 7.6 + removing oss for now.

* Resolving 7.6.0 docs issues

* Updating ML docs
2020-02-15 19:52:04 +01:00
stevedodson
1cfcd0ab2b
Resolving docs issues (#128)
* Updating test matrix for 7.6 + removing oss for now.

* Resolving 7.6.0 docs issues
2020-02-15 19:37:41 +01:00
stevedodson
404e658a26
Updating test matrix for 7.6 + removing oss for now. (#127) 2020-02-15 18:48:17 +01:00
stevedodson
b535e69b92
Updating to 7.6.0a1 (#126) 7.6.0a1 2020-02-15 16:14:48 +01:00
stevedodson
7c1c2945a7
ML add externral models (#125)
* Partially implemented implementation of ml.ExternalModel

* Adding eland.ml.ExternalMLModel

More testing to be added + more support for MLModels
2020-02-15 15:54:29 +01:00
stevedodson
4ac67a73ea
Bumping version (#123) 7.5.1a4 2020-02-05 09:59:54 +00:00
stevedodson
c5f5d00bb0
Adding support for df['timestamp'].min() etc. (#122)
There is still a difference between pandas/eland in terms
of min/max etc. aggregations as pandas supports this
on strings.
2020-01-30 11:03:37 +00:00
stevedodson
2ca538c49d
Feature/show progress (#120)
* Adding show_progress debug option to eland_to_pandas

* Adding show_progress debug option to eland_to_pandas
2020-01-29 12:59:48 +00:00
stevedodson
409cb043c8
Refactoring of plotting + fixes for multiple charts (#117)
* Refactoring of plotting + fixes for multiple charts

Updates to plotting inline with pandas 0.25.3
Enables plotting of multiple histograms on the
same figure.

* Fix to setup.py to allow submodules

+ reformat of code and better Series.hist docs
2020-01-29 07:07:56 +00:00
stevedodson
46b428d59b
Improved read_csv docs + made 'to_eland' params consistent (#114)
* Improved read_csv docs + made 'to_eland' params consistent

Note, will change API.

* Removing additional args from pytest.

doctests + nbval tests in the CI are not addressed by
this PR.
2020-01-16 10:17:49 +00:00
stevedodson
1914644f93
Improve docs (#113)
* Adding more examples

* Adding more examples to README.md + pypi first page.

* Updated README.md
7.5.1a3
2020-01-13 15:32:41 +00:00
stevedodson
86c51dc267
Fix licensing headers (#112)
* Minor fixes for readthedocs compatibility.

* Adding doc templates

* Setting first version to 7.5.1
2020-01-13 11:54:43 +00:00
stevedodson
db3bb02335
Rename LICENSE to LICENSE.txt 2020-01-13 11:42:20 +00:00
stevedodson
277a52a242
Update LICENSE 2020-01-13 11:41:43 +00:00
stevedodson
2f87ca5901
Delete LICENSE.txt (#111)
* Delete LICENSE.txt

* Create LICENSE
2020-01-13 11:26:11 +00:00
stevedodson
5995e11bfd
Update README.md 2020-01-13 10:22:42 +00:00
stevedodson
a4736150f6
Update README.md 2020-01-13 09:01:34 +00:00
stevedodson
d7207bab3b
7.5.1a2 (#110)
* Updating README.md

* New version

* Fixing description for pypi
7.5.1a2
2020-01-10 15:40:15 +00:00
stevedodson
00fb775d29
Feature/versioning (#109)
* Minor fixes for readthedocs compatibility.

* Adding doc templates

* Setting first version to 7.5

* Resolving pypi issues + minor docs
7.5.1a1
2020-01-10 14:38:56 +00:00
stevedodson
f93b893f9d
Setting version number to valid version (#108)
* Minor fixes for readthedocs compatibility.

* Adding doc templates

* Setting first version to 7.5
2020-01-10 11:47:52 +00:00
stevedodson
1c772d0e50
More readthedocs fixes. (#107)
* Minor fixes for readthedocs compatibility.

* Adding doc templates
2020-01-10 11:33:51 +00:00
stevedodson
1d273ae465
Update README.md 2020-01-10 11:13:29 +00:00
stevedodson
679f8f4170
Minor fixes for readthedocs compatibility. (#106) 2020-01-10 11:02:51 +00:00
stevedodson
c3c2f8a020
Minor updates to README.md + merge fixes (#105) 2020-01-10 09:26:13 +00:00
stevedodson
a3293168a1
Feature/filtered hist (#104)
* Adding python 3.5 compatibility.

Main issue is ordering of dictionaries.

* Updating notebooks with 3.7 results.

* Removing tempoorary code.

* Defaulting to OrderedDict for python 3.5 + lint all code

All code reformated by PyCharm and inspection results analysed.

* Adding support for multiple arithmetic operations.

Added new 'arithmetics' file to manage this process.
More tests to be added + cleanup.

* Signficant refactor to arithmetics and mappings.

Work in progress. Tests don't pass.

* Major refactor to Mappings.

Field name mappings were stored in different places
(Mappings, QueryCompiler, Operations) and needed to
be keep in sync.

With the addition of complex arithmetic operations
this became complex and difficult to maintain. Therefore,
all field naming is now in 'FieldMappings' which
replaces 'Mappings'.

Note this commit removes the cache for some of the
mapped values and so the code is SIGNIFICANTLY
slower on large indices.

In addition, the addition of date_format to
Mappings has been removed. This again added more
unncessary complexity.

* Adding OrderedDict for 3.5 compatibility

* Fixes to ordering issues with 3.5

* Adding simple cache for mappings in flatten

Improves performance significantly on large
datasets (>10000 rows).

* Adding updated notebooks (new info_es).

All tests (doc + nbval + pytest) pass.

* Fixing issue with non-zero offset histograms.
2020-01-10 08:17:45 +00:00
stevedodson
903fbf0341
Feature/mapping cache (#103)
* Adding python 3.5 compatibility.

Main issue is ordering of dictionaries.

* Updating notebooks with 3.7 results.

* Removing tempoorary code.

* Defaulting to OrderedDict for python 3.5 + lint all code

All code reformated by PyCharm and inspection results analysed.

* Adding support for multiple arithmetic operations.

Added new 'arithmetics' file to manage this process.
More tests to be added + cleanup.

* Signficant refactor to arithmetics and mappings.

Work in progress. Tests don't pass.

* Major refactor to Mappings.

Field name mappings were stored in different places
(Mappings, QueryCompiler, Operations) and needed to
be keep in sync.

With the addition of complex arithmetic operations
this became complex and difficult to maintain. Therefore,
all field naming is now in 'FieldMappings' which
replaces 'Mappings'.

Note this commit removes the cache for some of the
mapped values and so the code is SIGNIFICANTLY
slower on large indices.

In addition, the addition of date_format to
Mappings has been removed. This again added more
unncessary complexity.

* Adding OrderedDict for 3.5 compatibility

* Fixes to ordering issues with 3.5

* Adding simple cache for mappings in flatten

Improves performance significantly on large
datasets (>10000 rows).

* Adding updated notebooks (new info_es).

All tests (doc + nbval + pytest) pass.
2020-01-10 08:12:03 +00:00
stevedodson
efe21a6d87
Feature/arithmetic ops (#102)
* Adding python 3.5 compatibility.

Main issue is ordering of dictionaries.

* Updating notebooks with 3.7 results.

* Removing tempoorary code.

* Defaulting to OrderedDict for python 3.5 + lint all code

All code reformated by PyCharm and inspection results analysed.

* Adding support for multiple arithmetic operations.

Added new 'arithmetics' file to manage this process.
More tests to be added + cleanup.

* Signficant refactor to arithmetics and mappings.

Work in progress. Tests don't pass.

* Major refactor to Mappings.

Field name mappings were stored in different places
(Mappings, QueryCompiler, Operations) and needed to
be keep in sync.

With the addition of complex arithmetic operations
this became complex and difficult to maintain. Therefore,
all field naming is now in 'FieldMappings' which
replaces 'Mappings'.

Note this commit removes the cache for some of the
mapped values and so the code is SIGNIFICANTLY
slower on large indices.

In addition, the addition of date_format to
Mappings has been removed. This again added more
unncessary complexity.

* Adding OrderedDict for 3.5 compatibility

* Fixes to ordering issues with 3.5
2020-01-10 08:05:43 +00:00
Martijn Laarman
617583183f Move to latest .ci script structure (#101)
Introduces a dedicated `run-repository.sh` for the repository custom
bits.

This allows us to keep `run-elasticsearch.sh` and `run-tests` in sync
through file copying or patches easier.
2020-01-09 11:18:56 +01:00
stevedodson
bdaea4658c
Fixing addition repr test for python 3.5. (#100) 2019-12-12 15:57:52 +01:00
Camilla
a5380813a7
Adds Python 3.8 support (#96)
* Adds build status sticker to README

* Adds Python version to test matrix

* Adds debug echo message

* Adds back Python 3.5.3 to test matrix

* Adds Python version to test matrix

* Adds back Python 3.5.3 to test matrix

* Adds Python 3.8 to test matrix
2019-12-12 14:57:52 +01:00
stevedodson
5a3c73ea54
Feature/info es fix (#99)
* Resolving inconsistent __repr__ test on python 3.5

* Fixing layout for info_es + adding Series.hist doc
2019-12-12 14:36:56 +01:00
stevedodson
4bb73215a0
Resolving inconsistent __repr__ test on python 3.5 (#98) 2019-12-12 12:51:29 +01:00
Michael Hirsch
79fdb1727e
Add Support for Series Histograms (#95)
* add support for series plotting
* update docs for series plotting support
* add tests for series plotting
* fix typo
* adds comment to ed_hist_series
2019-12-11 14:51:47 -05:00
Camilla
40a584dcc2
Adds build status sticker to README and runs test on different Python versions (#84)
* Adds build status sticker to README

* Adds Python version to test matrix

* Adds debug echo message

* Adds back Python 3.5.3 to test matrix
2019-12-11 15:41:34 +01:00
stevedodson
c5730e6d38
Feature/python 3.5 (#93)
* Adding python 3.5 compatibility.

Main issue is ordering of dictionaries.

* Updating notebooks with 3.7 results.

* Removing tempoorary code.

* Defaulting to OrderedDict for python 3.5 + lint all code

All code reformated by PyCharm and inspection results analysed.
2019-12-11 14:27:35 +01:00
stevedodson
9a2d55f3c8
Feature/pandas.0.25.3 (#92)
* Resolving pandas link

* Removing temporary file
2019-12-10 19:22:27 +01:00
stevedodson
e8a0fbb9f3
Feature/pandas.0.25.3 (#91)
* Added example notebooks + pytest for these notebooks1

* Fixed paths

* Fixing link in docs

* Minor update for pandas 0.25.3

* Updates for pandas 0.25.3

* Fixing doc links with pandas 0.25.3 update.

* Reverting overwrite to changes to notebooks.
2019-12-10 16:05:37 +01:00
stevedodson
133b227b93
Added example notebooks + pytest for notebooks (#87)
* Added example notebooks + pytest for these notebooks1

* Fixed paths

* Fixing link in docs

* Adding cleaner demo_notebook
2019-12-10 15:27:13 +01:00
stevedodson
206276c5fa
Adding Apache 2 copyright header to all .py files (#86) 2019-12-06 09:44:05 +00:00
stevedodson
f06219f0ec
Feature/refactor tasks (#83)
* Significant refactor of task list in operations.py

Classes based on composite pattern replace tuples for
tasks.

* Addressing review comments for eland/operations.py

* Minor update to review fixes

* Minor fix for some better handling of non-aggregatable fields: https://github.com/elastic/eland/issues/71

* Test for non-aggrgatable value_counts

* Refactoring tasks/actions

* Removing debug and fixing doctest
2019-12-06 08:46:43 +00:00
Michael Hirsch
f263e21b8a Better Handling of Non Aggregatable Fields (#85)
* updates ecommerce mapping to include non-aggregatable text field

* updates exists tests and adds new tests for non-aggregatable field

* better handling on non-aggregatable fields

* fixes formatting

* swaps series in assertion

* adds newline
2019-12-06 08:20:09 +00:00
Francesco Vigliaturo
99bfea42b6
Added support for 2 date formats: (#70)
* Adds support for multiple date formats
2019-12-04 17:42:50 +01:00
stevedodson
13141645f7
Merge pull request #82 from stevedodson/feature/fix_nested_not_filters
Feature/fix nested not filters
2019-12-03 15:32:31 +00:00
Stephen Dodson
1423aaad2d Adding minor fixes to last PR 2019-12-03 14:07:05 +00:00
Stephen Dodson
57857277cd Merge remote-tracking branch 'upstream/master' into feature/fix_nested_not_filters 2019-12-03 14:03:03 +00:00
Stephen Dodson
bf6c56878a Correcting license files + fixing bug in filter
LICENSE and NOTICE conform to Elastic policy. Bug in
nested negated filters fixed.

Also, some limited cleanup.
2019-12-03 13:56:49 +00:00
Camilla
b56588463c
Merge pull request #81 from Winterflower/pull-request-job
Refactors tests and adds config to fix SSL issue
2019-12-03 14:20:01 +01:00
Winterflower
3e82d43351 Merge branch 'pull-request-job' of https://github.com/Winterflower/eland into pull-request-job 2019-12-02 20:32:30 +01:00
Winterflower
10e1adb680 Removes code duplication in test code 2019-12-02 20:31:53 +01:00