feature: Row.allow_break_across_pages #245

AlbinoShadow · 2016-01-08T20:30:48Z

I'll try to keep it sweet and simple... There is a variable, paragraph.paragraph_format.keep_together, when it's true it will keep the paragraph on a single page instead of splitting it if it's at the end of the page.

I tried using this in a table scenario and it didn't work, although Word has the capability. After comparing the document.xml between a document where a row can split pages and one that cannot, I found that it acts as a table row property instead of a paragraph property (cantSplit vs keepLines in XML).

I can't seem to find any info regarding this within python-docx so I may make my own docx function for it, but I've got very little XML knowledge so I'd like to avoid that.

References for @scanny if he sees this:

<w:tr w:rsidR="00137A85" w:rsidTr="005B6BA3">
        <w:trPr>
          <w:cantSplit/>
        </w:trPr>

That's the XML with the row not splitting pages and this is without:

<w:tr w:rsidR="00137A85">

Obvious difference is the fact there's no table row property section and also the initial table row definition is different. If I end up making my own XML edits for this I'll make sure to post them.

AlbinoShadow · 2016-01-11T15:26:29Z

Fixed the issue following the information in this thread: #55 (comment)

Thanks for the tool @scanny it's extremely helpful =)

scanny · 2016-04-09T21:08:02Z

Reopening as feature request for Table.allow_break_across_pages. Thanks for this @AlbinoShadow :)

AlbinoShadow · 2016-04-11T17:59:12Z

@scanny here's the code that I ended up using to fix the issue:

from docx.oxml.shared import OxmlElement, qn # Necessary Import

def preventDocumentBreak(document):
  tags = document.element.xpath('//w:tr')
  rows = len(tags)
  for row in range(0,rows):
    tag = tags[row]                     # Specify which <w:r> tag you want
    child = OxmlElement('w:cantSplit')  # Create arbitrary tag
    tag.append(child)                   # Append in the new tag

I only had a single table in my document so I just applied it to every cell I believe. It's some edited code I found in another one of your comments I believe, but figured it doesn't hurt to post it. Thanks again, python-docx has made a huge difference in my job and is the reason I learned Python.

scanny · 2016-04-12T05:23:39Z

Super, thanks Joe :)

linuxkd · 2020-05-15T03:18:04Z

For those people that are looking to do the opposite and allow the row to go over multiple pages.

def allowDocumentBreak(document):
    """Allow table rows to break across pages."""
    tags = document.element.xpath("//w:tr")
    rows = len(tags)
    for row in range(0, rows):
        tag = tags[row]  # Specify which <w:r> tag you want
        child = OxmlElement("w:cantSplit")  # Create arbitrary tag
        child.set(qn("w:val"), "0")
        tag.append(child)  # Append in the new tag

vlad-belogrudov · 2022-10-17T12:55:20Z

looks like in the current format you have to have trPr tag (table row properties) for such tr. In the trPr you can specify property OxmlElement("w:cantSplit").

Rather life-hack (since it's internal api), to set "no-break" for a row:

row = table.add_row()
trPr = row._tr.get_or_add_trPr()
trPr.append(OxmlElement('w:cantSplit'))

muhammadahmadazhar · 2023-04-18T08:05:23Z

it fixed by
trPr = OxmlElement('w:trPr')
cantSplit = OxmlElement('w:cantSplit')
cantSplit.set(qn('w:val'), 'true')

trPr.append(cantSplit)
row._tr.append(trPr)

1krishnasharma · 2023-11-09T07:15:23Z

how to keep two rows of the table together. I have seen solution of splitting row. But if I want to keep two rows together such that if page end they should be in one page together.

like you can see in the image, how can i keep these two rows together in next page or previous page. anyone can help me with that?

scanny · 2023-11-09T18:21:00Z

@1krishnasharma the solutions here should work for you.

A slightly more robust implementation would be:

from docx.oxml import OxmlElement
from docx.oxml.ns import qn
from docx.table import _Row

def make_row_cant_split(row: _Row) -> None:
    tr = row._tr

    # -- if the element is already present, make sure it's turned on --
    cantSplits = tr.xpath("./w:trPr/w:cantSplit")
    if cantSplits:
        cantSplit = cantSplits[0]
        cantSplit.set(qn('w:val'), 'true')
        return

    # -- otherwise add it in bool-true state --
    trPr = tr.get_or_add_trPr()
    cantSplit = OxmlElement("w:cantSplit")
    cantSplit.set(qn('w:val'), 'true')
    trPr.insert_element_before(
        cantSplit,
        (
            "w:trHeight",
            "w:tblHeader",
            "w:tblCellSpacing",
            "w:jc",
            "w:hidden",
            "w:ins",
            "w:del",
            "w:trPrChange",
        ),
    )

ShoulddaBeenaWhaleBiologist · 2024-01-03T21:43:24Z

@scanny Thanks for the robust implementation 👍

Can I ask why it's better to use trPr.insert_element_before() vs the trPr.append() used in other examples above?
Is inserting this table row property before all those others you listed more in line with the spec? Just results in more consistently correct behavior or something?

Thanks again

scanny · 2024-01-03T21:53:55Z

@ShoulddaBeenaWhaleBiologist In general, child elements in the OpenXML schema are specified as a sequence, meaning they have a specified order. Sometimes this order matters, so placing a new element in the right position is something we always do in the library. That's what .insert_element_before() does, a longer name would be "insert this element before any of the following that already appear as a child".

.append() places the new element as the last child. Often this will work and folks do it all the time, but "working" can be client specific, so testing it with LibreOffice might work and with Word not. So I never take the chance and just put it in the right order.

AlbinoShadow closed this as completed Jan 11, 2016

scanny changed the title ~~Tables that keep cell data on a single page~~ feature: Row.allow_break_across_pages Apr 9, 2016

scanny added the table label Apr 9, 2016

scanny reopened this Apr 9, 2016

EstiShay mentioned this issue Jul 2, 2018

Prevent tables from flowing across page break consbio/salcc_blueprint2#30

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feature: Row.allow_break_across_pages #245

feature: Row.allow_break_across_pages #245

AlbinoShadow commented Jan 8, 2016

AlbinoShadow commented Jan 11, 2016

Uh oh!

scanny commented Apr 9, 2016

Uh oh!

AlbinoShadow commented Apr 11, 2016

Uh oh!

scanny commented Apr 12, 2016

Uh oh!

linuxkd commented May 15, 2020

Uh oh!

vlad-belogrudov commented Oct 17, 2022 •

edited

Loading

Uh oh!

muhammadahmadazhar commented Apr 18, 2023

Uh oh!

1krishnasharma commented Nov 9, 2023 •

edited

Loading

Uh oh!

scanny commented Nov 9, 2023

Uh oh!

ShoulddaBeenaWhaleBiologist commented Jan 3, 2024

Uh oh!

scanny commented Jan 3, 2024

Uh oh!

feature: Row.allow_break_across_pages #245

feature: Row.allow_break_across_pages #245

Comments

AlbinoShadow commented Jan 8, 2016

AlbinoShadow commented Jan 11, 2016

Uh oh!

scanny commented Apr 9, 2016

Uh oh!

AlbinoShadow commented Apr 11, 2016

Uh oh!

scanny commented Apr 12, 2016

Uh oh!

linuxkd commented May 15, 2020

Uh oh!

vlad-belogrudov commented Oct 17, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

muhammadahmadazhar commented Apr 18, 2023

Uh oh!

1krishnasharma commented Nov 9, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

scanny commented Nov 9, 2023

Uh oh!

ShoulddaBeenaWhaleBiologist commented Jan 3, 2024

Uh oh!

scanny commented Jan 3, 2024

Uh oh!

vlad-belogrudov commented Oct 17, 2022 •

edited

Loading

1krishnasharma commented Nov 9, 2023 •

edited

Loading