dump/load sessions with non-arrays (hdf, pickle) #153

gdementen · 2017-03-16T09:44:32Z

I am thinking mostly of groups and scalars.

This would be especially important for users of the standalone interface (see #88).

When using the standalone interface, they will probably want to also dump functions defined within the console in the same file, so that they can start working exactly where they left of, but we could use a specific format for that (e.g pickle the whole session), but in that case, using from larray import * will become problematic.

The text was updated successfully, but these errors were encountered:

gdementen · 2017-03-21T08:48:07Z

This might be a crazy idea, but, if we want to save python functions inside an .xlsx, we could place them inside macros, potentially wrapped inside xlwings RunPython so that they are even executable from within Excel. For maintainability and collaboration it is probably better to save the python file externally (so that you can put it on a VCS, but there might be cases where this could be useful.

In that case, this could be useful:
http://stackoverflow.com/questions/17197259/use-python-to-inject-macros-into-spreadsheets

gdementen · 2017-05-04T11:02:59Z

We might want to use something like https://github.com/h5io/h5io or https://github.com/telegraphic/hickle

gdementen · 2017-05-04T11:07:56Z

Or, we could roll our own stuff. If we go that route and want to continue using Pandas HDFStore, this might be useful:

http://stackoverflow.com/questions/29129095/save-additional-attributes-in-pandas-dataframe/29130146#29130146

In [172]: df = pd.DataFrame(np.random.randn(8,3))
In [173]: store = pd.HDFStore('test.h5')
In [174]: store.put('df',df)
# you can store an arbitrary python object via pickle
In [175]: store.get_storer('df').attrs.my_attribute = dict(A = 10)
In [176]: store.get_storer('df').attrs.my_attribute
Out[176]: {'A': 10}

gdementen · 2017-09-20T11:08:50Z

Here are links to stuff that might help (or not). This is a dump of all the browser windows I have been keeping open for months. ;-)

standard pickle (if using a low enough protocol) are guaranteed to work across Python versions. Marshal (used to create .pyc) on the other hand is not.
to pickle tracebacks: https://github.com/ionelmc/python-tblib
better pickles (pickle more kinds of objects than standard pickle):
- cloudpickle: https://github.com/cloudpipe/cloudpickle
- dill:
  - https://pypi.python.org/pypi/dill
  - http://trac.mystic.cacr.caltech.edu/project/pathos/wiki/dill.html
  - of note is: dill also provides the capability to save and load python interpreter sessions
jsonpickle (serialize most python objects as json -- but with similar security implications than pickle!). The format is nicer than pickle, but I am unsure it brings any value to the table, if we do not plan to exchange sessions across languages.
https://jsonpickle.github.io/
teleport: JSON with "types"/predefined structure. An nice alternative if we only want to serialiaze data.
http://www.teleport-json.org/python/
signed pickles (python2 only): http://trustedpickle.sourceforge.net/
this works by generating a set of private/public keys and signing outgoing pickles with the private key and checking the sender is "trusted" by using the public key. This obviously does not prevent an attacker to sign his code, so it wouldn't help for making the usual case (here is my session, could you try it?) safer. Given that it would probably be "tedious" to setup for users, it will probably not be worth the trouble for a looong time. But the idea is interesting nevertheless, if we need to regularly exchange data with the same external users and want to provide some security.

alixdamman · 2018-04-06T14:32:51Z

@gdementen What is the difference between this issue and #578?

If the difference is to handle objects that are not LArray, Axis and Group (like session's metadata), I suggest to rename this issue as "dump/load sessions with metadata"?

gdementen · 2018-04-06T14:55:55Z

#578 is about the same thing as this issue except that it is more limited. This issue is about saving an interactive session to disk, close the editor, then load it back and continue working. Issue #578 will indeed bring us closer to that goal but not all the way to it. We need at least scalars and "simple python structures" of supported types to get there. Possibly functions/arbitrary objects too, but that can come later.

Note that neither issue speak about session metadata. I think I never thought about that but users will indeed want this at some point, so please create an issue for this (to be done after or at the same time than metadata for LArray #78 and #79).

alixdamman · 2018-04-06T15:04:29Z

To tell the truth, I'm thinking to try to implement a Jupyterlab extension for LArray and then abandon the editor (not soon).

OK to create a new issue --> see #615

gdementen added the enhancement label Mar 16, 2017

gdementen added the component: excel label May 4, 2017

This was referenced May 4, 2017

Save, SaveAs and Open should handle non-LArray objects. File formats which cannot save everything should be handled with Import and Export #241

Closed

ability to save/restore full viewer state #216

Open

gdementen mentioned this issue Jun 1, 2017

Session.save fails when it contains 0d arrays #291

Closed

alixdamman mentioned this issue Jun 13, 2017

fix #291 + #293 + #313 : Session.save (0D arrays + Excel + overwrite file by default) #312

Merged

alixdamman added this to the 0.29 milestone Feb 12, 2018

alixdamman mentioned this issue Feb 12, 2018

Allow to save and load all Axis and Group objects of a session in/from HDF, CSV and EXCEL files #578

Closed

alixdamman modified the milestones: 0.29, nice_to_have Apr 6, 2018

gdementen added the priority: low label Aug 1, 2019

gdementen removed this from the nice_to_have milestone Aug 1, 2019

alixdamman added this to the nice_to_have milestone Oct 10, 2019

alixdamman changed the title ~~dump/load sessions with non-arrays~~ dump/load sessions with non-arrays (hdf, pickle) Nov 29, 2019

alixdamman removed the component: excel label Nov 29, 2019

gdementen mentioned this issue Jan 7, 2020

include scalars when dumping/loading sessions (hdf, pickle) #842

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

dump/load sessions with non-arrays (hdf, pickle) #153

dump/load sessions with non-arrays (hdf, pickle) #153

gdementen commented Mar 16, 2017

gdementen commented Mar 21, 2017 •

edited

Loading

gdementen commented May 4, 2017

gdementen commented May 4, 2017

gdementen commented Sep 20, 2017 •

edited

Loading

alixdamman commented Apr 6, 2018

gdementen commented Apr 6, 2018

alixdamman commented Apr 6, 2018 •

edited

Loading

dump/load sessions with non-arrays (hdf, pickle) #153

dump/load sessions with non-arrays (hdf, pickle) #153

Comments

gdementen commented Mar 16, 2017

gdementen commented Mar 21, 2017 • edited Loading

gdementen commented May 4, 2017

gdementen commented May 4, 2017

gdementen commented Sep 20, 2017 • edited Loading

alixdamman commented Apr 6, 2018

gdementen commented Apr 6, 2018

alixdamman commented Apr 6, 2018 • edited Loading

gdementen commented Mar 21, 2017 •

edited

Loading

gdementen commented Sep 20, 2017 •

edited

Loading

alixdamman commented Apr 6, 2018 •

edited

Loading