Parser adds <br> tag to every paragraph

dnrahamim · September 26, 2018, 1:49pm

When I supply supply a content div which contains a paragraph with trailing line breaks, then I parse that div according to the demo schema, then I use the resulting content to create an EditorView, the HTML produced by the EditorView contains an extra line break

Start with this content

<div id="content">
  <p>
    hello
    <br>
    <br>
  </p>
</div>

Parse the content let startDoc = DOMParser.fromSchema(demoSchema).parse(content)
Create a new EditorView let view = new EditorView(document.querySelector("#editor"), { …
Editor has a third line break that wasn’t in the original HTML

<div contenteditable="true" class="ProseMirror ProseMirror-example-setup-style">
  <p>
    hello
    <br>
    <br>
    <br>
  </p>
</div>

Anyone have this issue before? Any idea the best way to fix it? Thank you!

marijn · September 26, 2018, 2:24pm

The third break is just there in the editor view to make the second break visible. It isn’t part of the document and you probably shouldn’t worry about it.

dnrahamim · September 26, 2018, 2:42pm

@marijn thank you for your quick reply! okeydokey, i understand.

however! the reason this was causing me trouble is that, for my application, i’ve been saving the HTML contents of my editor to my database, then parsing them back in when I want to continue working on my document. this causes the document to slowly accumulate line breaks as I save and refresh.

is it not conventional to save the HTML of my document and read it back in? should I be exclusively saving JSON and reading JSON?

dnrahamim · September 26, 2018, 5:37pm

I made this change to addTextBlockHacks() which seems to solve my problem.

Hopefully this is not a bad idea!

addTextblockHacks() {
    let lastChild = this.top.children[this.index - 1]
    while (lastChild instanceof MarkViewDesc) lastChild = lastChild.children[lastChild.children.length - 1]

    if(!lastChild || // Empty textblock
      lastChild.node.type.name !== "hard_break") { // Don't double up on line breaks
      if (!(lastChild instanceof TextViewDesc) ||
          /\n$/.test(lastChild.node.text)) {
        if (this.index < this.top.children.length && this.top.children[this.index].matchesHack()) {
          this.index++
        } else {
          let dom = document.createElement("br")
          this.top.children.splice(this.index++, 0, new BRHackViewDesc(this.top, nothing, dom, null))
          this.changed = true
        }
      }
    }
  }

marijn · September 27, 2018, 7:35am

Definitely a bad idea.

And so is taking the innerHTML from the editor. Run the document through a DOMSerializer instead and save that HTML—that’ll be the actual document representation, rather than whatever the editor needs to display to make browsers behave.

philippkuehn · September 27, 2018, 11:04am

In case you need it: This is my function to get HTML from prosemirror:

getHTML() {
	const div = document.createElement('div')
	const fragment = DOMSerializer
		.fromSchema(this.schema)
		.serializeFragment(this.state.doc.content)

	div.appendChild(fragment)

	return div.innerHTML
}

dnrahamim · September 27, 2018, 6:01pm

Excellent, thank you!

@philippkuehn I’ll give your function a shot

eojina · December 5, 2019, 4:29pm

Your code has helped me a lot. Great job!!!

ligne13 · December 9, 2019, 9:12am

Hi @marijn, I don’t understand why it needs to add a third break. The 2 first breaks are already visible. Adding a third break adds a visual third break which is not what I want (I only want to see 2 breaks).

I wan not able to fix this on my side. The editor keeps adding breaks that are visible.

herrdu · July 16, 2020, 12:14pm

我也采用了你的方法

herrdu · September 3, 2020, 4:45am

由于修改了 addTextblockHacks 方法，可能造成了编辑出现错乱，因此将修改方法改为 setContent 时候进行，代码如下

      Array.prototype.forEach.call(pList, (child: HTMLParagraphElement) => {
    console.log('child', child);
    if (child.children.length === 1) {
      let lastChild = child.children[0];

      while (lastChild.children.length === 1) {
        lastChild = lastChild.children[0];
      }

      if (lastChild.nodeName === 'BR' && /^\n+$/.test(child.textContent || '')) {
        child.setAttribute('style', '');
        child.innerHTML = '';
      }
    }
  });