Header and Footer #291

eupharis · 2016-04-20T20:33:44Z

No description provided.

scanny · 2016-04-21T05:05:22Z

docx/header.py

+
+    @text.setter
+    def text(self, text):
+        raise NotImplementedError('todo')


Okay, I just revised this commit and merged it in: https://github.com/python-openxml/python-docx/commits/feature/header

Since it's your first one of these I'm thinking it's faster that way, we can maintain momentum and I'll just note the whys and wherefores here so you can pick it up as you go :)

You'll probably want to rebase and force push after you've digested the comments so you're building on the main branch, but as long as your commits are individually mergeable without error there's some discretion on all that. I'm going to be cherry picking them and generally there are always revisions in the process. I'll leave the rebasing decision to you.

I do recommend you study the diff and understand why each change was made because they'll come up over and over. The general patterns are remarkably consistent on this project, probably because everything is just a fancy way of reading and writing to an XML file :)

All classes, functions, and methods get a full docstring at their first appearance. Follow the conventions of the rest of the library. Everyone's got a favorite; but consistency is the objective here.

Header should inherit from ElementProxy because it will wrap the <w:header> element. More on that later.

These two methods are unsupported by tests and not yet indicated by the acceptance test errors. So they come out for now. They'll get their turn in due course :)

Sounds good!

eupharis · 2016-05-02T22:35:35Z

docx/header.py

    """
-
-    __slots__ = ()


I used BlockItemContainer in my previous implementation of header and it works great. And its docstring says it's supposed to be used for headers.

It gives you all the .paragraphs and .add_paragraphs and all that jazz pretty much out of the box.

Header is a bit weird in that it needs two separate elements in two separate files to function:

<w:headerReference> in document.xml <w:header> in header1.xml (or whatever)

in my previous implementation i just made Header handle <w:header> in header1.xml and then did this for the headerReference element

header_ref_elm_tag = 'w:headerReference' header_ref_elm = OxmlElement(header_ref_elm_tag, attrs=header_attrs) # add the relationship sectPr.insert(0, header_ref_elm)

it seemed to work pretty well.

scanny · 2016-05-04T01:51:40Z

Okay, I've added my comments on that commit. Want to have another go and I think we'll have this commit ready to merge :)

scanny · 2016-05-10T04:02:03Z

docx/header.py

 from .shared import ElementProxy


 class Header(ElementProxy):
    """
-    The default page header of a section.
+    Proxy for ``<w:hdr>`` element in this section, having primarily a
+    container role.


This commit is merged now, these comments are just for reference for future commits :)
https://github.com/python-openxml/python-docx/commits/feature/header

I left this change out. The docstring is for the library end-user (developer using python-docx) because it appears in the API documentation. So the details of what's being proxied wouldn't generally be a concern for them. I know I've used the "container role" phrase elsewhere, but I'm thinking if we want to elaborate we'd want to do so with specifics that would be helpful to a developer learning how to use this object. I'm sure we'll find more tidbits to add here as we go.

scanny · 2016-05-10T05:21:02Z

As far as next steps are concerned, it's probably time to start a new hdr-header-props.feature file and start on the header properties.

In general we'll want to stay to the "getter" side of things until that's pretty well fleshed out; the acceptance tests for the "setter" side generally use the getter side methods to validate they worked.

I'm thinking the getter aspect of Header.is_linked_to_previous is a good place to start. So something like:

Scenario Outline: Get Header.is_linked_to_previous
  Given a header <having-or-no> definition 
   Then header.is_linked_to_previous is <value>

  Examples: ...
    | having-or-no | value |
    | having a     | False |
    | having no    | True  |

That will get us down to the next step in the process, which is accessing the right header definition elements under w:sectPr.

After that, I'm thinking the next step is Header.body, which is the actual block item container. Something like:

Scenario: Get Header.body
  Given a header having a definition 
   Then header.body is a BlockItemContainer object

This will need to get more sophisticated later, adding a case for a header that doesn't have a definition and may need to create one, but this will be a start.

Want to work in that direction and we'll see where we go?

eupharis · 2016-05-10T23:58:12Z

@scanny ok! I just did that acpt test and pushed it. I tried to keep it to just the acceptance test and nothing else for that commit.

Tomorrow I'll dive in to the implementation and unittests.

eupharis · 2016-06-13T17:54:30Z

@scanny Ok! I reviewed the top commit. Looks good.

I just pushed those analysis updates. I think maybe we should hide the header.body interface, like on document. So the api people actually use is just header.add_paragraph or header.paragraphs, which internally uses header._body. What do you think?

What are the next steps to code? related_hdrftr_body?

To add the header relationship before between hdr and headerReference I did this

        reltype = nsmap['r'] + '/header'
        self._parent.part.rels.add_relationship(reltype, header_part, rel_id)

So for this reading stage we'll have to interact with self._parent.part.rels and use that to get to the HeaderPart

Here's what I had for the HeaderPart before:

class HeaderPart(XmlPart):
    # MOSTLY COPYPASTA FROM DOCUMENT PART BELOW THIS POINT
    # TODO ABSTRACT?
    @property
    def next_id(self):
        """
        The next available positive integer id value in this document. Gaps
        in id sequence are filled. The id attribute value is unique in the
        document, without regard to the element type it appears on.
        """
        id_str_lst = self._element.xpath('//@id')
        used_ids = [int(id_str) for id_str in id_str_lst if id_str.isdigit()]
        for n in range(1, len(used_ids)+2):
            if n not in used_ids:
                return n
        ...

And then in the docx/__init__.py this guy was necessary:

PartFactory.part_type_for[CT.WML_HEADER] = HeaderPart

So in English what I understand this to do is something like:

"For all content types in the document.xml.rels of type WML_HEADER, interact with the actual XML file via the HeaderPart."

eupharis · 2016-07-06T16:57:16Z

Whew busy few weeks! Hope yours has been good! @scanny where are we at on this? any next steps?

scanny · 2016-07-08T23:08:31Z

Hi Dan, apologies, this one slipped down my inbox a bit :)

It looks from the current behave error that the next step is to implement DocumentPart.related_hdrftr_body().

Hiding Header.body might possibly work. I assume the behavior you mean is that it would provide access to the paragraphs or whatever of the effective header, meaning this one if defined or the inherited one if not. So .body would never be None and it would add a new header on the first section if there were no other header to inherit from.

We can probably postpone a final decision until we get further along, since I'm thinking these would just amount to API shortcuts; the "long-form" calls would still need to be implemented.

On the rels stuff, most of those operations (I think all) have been factored out into docx.opc.Part. So for the situation you mention, all you need in Header should be:

from docx.opc.constants import RELATIONSHIP_TYPE as RT
rId = self.part.relate_to(header_part, RT.HEADER)

If you look in docx.parts.styles.StylesPart.default() you can see an example of how a new part is created from scratch. SettingsPart has one too, very similarly implemented.

Let me know if you need more to go on :)

I'll wait to merge the docs updates until the next commit so you can update the page if you need to.

eupharis · 2016-07-11T19:04:01Z

Ok! Just pushed another commit.

For some reason I find all this Part stuff way more confusing than anything else in this project. I hope this commit is going in the right direction.

HeaderPart feels slightly different than NumberingPart, StylesPart, and SettingsPart in that we can have mulitple HeaderParts in the same Document and they can have arbitrary PackURIs.

With this commit, if CT_HeaderFooter were done the behave test might be close to passing? Would CT_HeaderFooter be the next step?

scanny · 2016-07-12T00:13:33Z

Yes, it took me a while to get the part bits worked out. Initially the part and the API object (like document) were the same object; it's only relatively recently that I made them distinct.

The thing about a part object is that it deals with packaging concerns (and interacting with other parts). This is distinct from the API object, which deals strictly with the XML inside the part. So that's where the boundary is.

Anytime you're dealing with an rId (relationship), it's going to involve the part, because only a part can interact with another part. Also, most of those interactions are factored out into the XmlPart or Part base classes.

Hopefully that helps clarify a little bit :)

HeaderPart is distinct (so far) in the way that you mention in python-docx. But it's not the first time we've encountered sets of parts. In python-pptx, slide parts (and others) are multiple. So we should have all the infrastructure in place for dealing with them built into the .opc sub-package.

I'll have a look at this commit and try to get it merged this week. I'm traveling, so it will be toward the end of the week.

scanny · 2016-07-12T00:26:10Z

docx/parts/document.py

+        just adding it because it seems much easier to mock than
+        self.rels.related_parts
+        """
+        return self.rels.related_parts[rId]


Let's keep this all in one piece. Here's an example of how I mocked related_parts in python-pptx. .related_parts is available directly on the part object, no need to go out to .rels to get it :)

# in pptx.parts.presentation ------ def related_slide(self, rId): """ Return the |Slide| object for the related |SlidePart| corresponding to relationship key *rId*. """ return self.related_parts[rId].slide # the rest of this is in tests.parts.test_presentation ------ def it_provides_access_to_a_related_slide(self, slide_fixture): prs_part, rId, slide_ = slide_fixture slide = prs_part.related_slide(rId) prs_part.related_parts.__getitem__.assert_called_once_with(rId) assert slide is slide_ @pytest.fixture def slide_fixture(self, slide_, related_parts_prop_): prs_part = PresentationPart(None, None, None, None) rId = 'rId42' related_parts_ = related_parts_prop_.return_value related_parts_.__getitem__.return_value.slide = slide_ return prs_part, rId, slide_ @pytest.fixture def related_parts_prop_(self, request): return property_mock(request, PresentationPart, 'related_parts')

eupharis · 2016-07-12T17:26:50Z

Ok cool. Parts make a bit more sense.

The related_parts mock makes sense. I guess __getitem__ is all we need to do. That's handy. I'll do that from now on.

Sum4196 · 2016-07-18T14:45:58Z

Hello @eupharis and @scanny, is there anything I can do to help with this?

scanny · 2016-07-18T19:00:07Z

Hi Michael, we're always happy to have help with things, the main question is whether you have the skills (or, like I was initially, the willingness to develop them quickly :). It's a pretty big library by many people's standards, and fairly complex, so it takes some time to get oriented. Also we maintain pretty high engineering standards in order to keep the library robust at this size with part-time attention.

If none of that scares you away I recommend reading through this full thread and its predecessor #264 and see what you can learn about what it takes. After that you would probably might want to pick a different feature because the coordination overhead would be fairly high and it would be easy to get in each other's way (it's a fairly linear process within a particular feature), but I'll leave that call to @eupharis.

Sum4196 · 2016-07-18T19:13:01Z

Your response is much appreciated @scanny, I was reading up on this and it seemed to be more of a linear process as you say which I don't want to interfere with. I do know that I am not highly skilled at the time being and am learning as I go (I have learned A LOT reading up on this project by the way). I can possibly take a look at the full thread that you referenced, however, I'm unsure if I would be of much help. I just figured I would ask anyway :)

dariojr · 2017-02-12T23:27:14Z

Hello! The library is very useful, thanks for your work.
I would like to help in the development of this functionality (header and footer). Although I have experience in the work, but new in collaboration GitHub and high standards.
Do you recommend some documentation to start?
I already read branch 291 and download the last merge to start.
Thank you!

scanny · 2017-02-13T18:34:39Z

@dariojr There is a branch named feature/header which contains the committed work so far: https://github.com/python-openxml/python-docx/commits/feature/header

About the last 11 commits on that branch is the header/footer work. If you want to take a look at that and see where you think you want to go next.

Best is probably to get the tests working on your machine as it's all test-driven development.

The unit tests run with py.test and the acceptance tests are run with behave. There is also syntax checking with flake8. Once you get those three installed, the tests are run with:

make clean & flake 8 & py.test && behave --stop

from the project directory as the current working directory.

What operating system are you on?

dariojr · 2017-02-27T21:37:08Z

Hi, work in Ubuntu 12.04 tls, I am testing now.
Results:
root@dario:/usr/local/lib/python2.7/dist-packages/docx# make clean & flake8 & py.test && behave --stop
[1] 5703
[2] 5704
make: *** No rule to make target `clean'. Stop.
===================================================== test session starts =====================================================
platform linux2 -- Python 2.7.3, pytest-3.0.6, py-1.4.32, pluggy-0.4.0
rootdir: /usr/local/lib/python2.7/dist-packages/docx, inifile:
collected 0 items

================================================ no tests ran in 0.05 seconds =================================================
[1]- Exit 2 make clean
root@dario:/usr/local/lib/python2.7/dist-packages/docx# ./image/png.py:135:9: E731 do not assign a lambda expression, use a def
./image/png.py:146:9: E731 do not assign a lambda expression, use a def
./oxml/init.py:67:1: E402 module level import not at top of file
./oxml/init.py:70:1: E402 module level import not at top of file
./oxml/init.py:73:1: E402 module level import not at top of file
./oxml/init.py:77:1: E402 module level import not at top of file
./oxml/init.py:89:1: E402 module level import not at top of file
./oxml/init.py:98:1: E402 module level import not at top of file
./oxml/init.py:119:1: E402 module level import not at top of file
./oxml/init.py:133:1: E402 module level import not at top of file
./oxml/init.py:151:1: E402 module level import not at top of file
./oxml/init.py:184:1: E402 module level import not at top of file
./oxml/init.py:187:1: E402 module level import not at top of file
./oxml/init.py:198:1: E402 module level import not at top of file
./enum/text.py:84:1: E305 expected 2 blank lines after class or function definition, found 1
./enum/shape.py:21:1: E305 expected 2 blank lines after class or function definition, found 1
root@dario:/usr/local/lib/python2.7/dist-packages/docx#

As you will see I install it in dis-packages to test also with my codes. But I cant function Candidate Protocol publish in:
http://python-docx.readthedocs.io/en/latest/dev/analysis/features/header.html

        header = document.sections[0].header
        header.is_linked_to_previous
        header.text = 'foobar'
        header.is_linked_to_previous

Any suggestions for me to continue introducing myself to this project? Thank you!

ascendedcrow · 2017-08-11T10:52:43Z

Is there a timeframe for this to go in?

tbell511 · 2017-09-18T17:52:32Z

Quick question... Why has this not been merged into another release?

khanzf · 2017-09-18T19:08:38Z

python-docx has 41 pull requests and has not updated since Jun 22, 2016.
It might be useful to fork and start a new active branch.

scanny · 2017-09-18T19:24:33Z

@formatically Work stalled on this contributor branch a while back. That often happens on contributed branches. If you want to pick it up by all means do. @khanzf This is a test-driven project, so no commits get merged without the tests. That discourages most casual additions it seems. In my experience only a relative few Python developers are test-driven development (TDD) guys. If you want to add some features by all means open the conversation here. Or if you want to maintain a non-TDD version, fork away :) You might soon be spending your weekends chasing bugs though. That's where indiscriminate committing of pull requests without tests leads in my experience. It might be more productive to become maintainer on this project.

guerda · 2018-02-16T11:43:32Z

@scanny: I'm happy to help to push this feature to release.
What do you see to be done to be able to merge this feature branch? As far as I can see, Travis has been building and testing that branch successfully. Again, I'm happy to help

wheeled · 2018-07-24T09:45:10Z

Hi guys, is this PR working? I’d like to check it out.

Spandan-Madan · 2018-08-01T01:13:33Z

Can this be used to edit the header of a docx file?

If so, can someone please give a quick example for doing so?

Thanks a lot!

scanny · 2019-01-06T23:11:15Z

A client sponsored the headers/footers feature and it is forthcoming in the next release v0.8.8 in the next few days. @ondrej-111 I credited you with several of the commits, although the implementation is somewhat different. You can see the implementation on the spike-header branch for comparison if you like. @eupharis I credited you with the analysis and user documentation commits. Thanks to both of you for your contributions to this feature and I'm glad we're finally getting it added to the build :)

eupharis · 2019-01-07T17:40:02Z

@scanny Great news! Thanks for finishing this. Excited to try it out :)

eupharis mentioned this pull request Apr 20, 2016

Header and Footer #264

Closed

scanny reviewed Apr 21, 2016
View reviewed changes

eupharis closed this May 2, 2016

eupharis force-pushed the feature/header2 branch from d11d160 to bbbc287 Compare May 2, 2016 17:59

eupharis reopened this May 2, 2016

eupharis force-pushed the feature/header2 branch from 7f02b5d to c68f667 Compare May 2, 2016 22:32

eupharis reviewed May 2, 2016
View reviewed changes

eupharis force-pushed the feature/header2 branch 5 times, most recently from 6459a93 to 3598440 Compare May 5, 2016 16:47

scanny reviewed May 10, 2016
View reviewed changes

eupharis force-pushed the feature/header2 branch from 3598440 to c9a99c5 Compare May 10, 2016 23:53

eupharis force-pushed the feature/header2 branch from 6971215 to 3bcfd5c Compare June 13, 2016 16:46

hdr: add related_hdrftr_body

8ed60ee

eupharis force-pushed the feature/header2 branch from 5dbf678 to 8ed60ee Compare July 11, 2016 18:55

scanny reviewed Jul 12, 2016
View reviewed changes

scanny mentioned this pull request Jul 26, 2016

Feature: Paragraph.add_hyperlink() (wip) #278

Closed

3 tasks

ardyflora mentioned this pull request Mar 12, 2017

Adding header not working #373

Closed

scanny removed the active label Mar 12, 2017

Benjamin-T mentioned this pull request Sep 2, 2018

Feature/bookmarks #445

Open

scanny closed this Jan 6, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Header and Footer #291

Header and Footer #291

eupharis commented Apr 20, 2016

scanny Apr 21, 2016

eupharis Apr 21, 2016

eupharis May 2, 2016 •

edited

Loading

scanny commented May 4, 2016

scanny May 10, 2016

scanny commented May 10, 2016

eupharis commented May 10, 2016

eupharis commented Jun 13, 2016 •

edited

Loading

eupharis commented Jul 6, 2016

scanny commented Jul 8, 2016

eupharis commented Jul 11, 2016

scanny commented Jul 12, 2016

scanny Jul 12, 2016

eupharis commented Jul 12, 2016

Sum4196 commented Jul 18, 2016

scanny commented Jul 18, 2016

Sum4196 commented Jul 18, 2016

dariojr commented Feb 12, 2017

scanny commented Feb 13, 2017

dariojr commented Feb 27, 2017 •

edited

Loading

ascendedcrow commented Aug 11, 2017

tbell511 commented Sep 18, 2017

khanzf commented Sep 18, 2017

scanny commented Sep 18, 2017

guerda commented Feb 16, 2018

wheeled commented Jul 24, 2018

Spandan-Madan commented Aug 1, 2018

scanny commented Jan 6, 2019

eupharis commented Jan 7, 2019

Header and Footer #291

Header and Footer #291

Conversation

eupharis commented Apr 20, 2016

scanny Apr 21, 2016

Choose a reason for hiding this comment

eupharis Apr 21, 2016

Choose a reason for hiding this comment

eupharis May 2, 2016 • edited Loading

Choose a reason for hiding this comment

scanny commented May 4, 2016

scanny May 10, 2016

Choose a reason for hiding this comment

scanny commented May 10, 2016

eupharis commented May 10, 2016

eupharis commented Jun 13, 2016 • edited Loading

eupharis commented Jul 6, 2016

scanny commented Jul 8, 2016

eupharis commented Jul 11, 2016

scanny commented Jul 12, 2016

scanny Jul 12, 2016

Choose a reason for hiding this comment

eupharis commented Jul 12, 2016

Sum4196 commented Jul 18, 2016

scanny commented Jul 18, 2016

Sum4196 commented Jul 18, 2016

dariojr commented Feb 12, 2017

scanny commented Feb 13, 2017

dariojr commented Feb 27, 2017 • edited Loading

ascendedcrow commented Aug 11, 2017

tbell511 commented Sep 18, 2017

khanzf commented Sep 18, 2017

scanny commented Sep 18, 2017

guerda commented Feb 16, 2018

wheeled commented Jul 24, 2018

Spandan-Madan commented Aug 1, 2018

scanny commented Jan 6, 2019

eupharis commented Jan 7, 2019

eupharis May 2, 2016 •

edited

Loading

eupharis commented Jun 13, 2016 •

edited

Loading

dariojr commented Feb 27, 2017 •

edited

Loading