Skip to content
GitLab
Projects Groups Snippets
  • /
  • Help
    • Help
    • Support
    • Community forum
    • Submit feedback
  • Register
  • Sign in
  • XSweet XSweet
  • Project information
    • Project information
    • Activity
    • Labels
    • Members
  • Repository
    • Repository
    • Files
    • Commits
    • Branches
    • Tags
    • Contributors
    • Graph
    • Compare
    • Locked Files
  • Issues 52
    • Issues 52
    • List
    • Boards
    • Service Desk
    • Milestones
    • Iterations
    • Requirements
  • Merge requests 2
    • Merge requests 2
  • CI/CD
    • CI/CD
    • Pipelines
    • Jobs
    • Schedules
    • Test Cases
  • Deployments
    • Deployments
    • Environments
    • Releases
  • Monitor
    • Monitor
    • Incidents
  • Analytics
    • Analytics
    • Value stream
    • CI/CD
    • Code review
    • Insights
    • Issue
    • Repository
  • Wiki
    • Wiki
  • Activity
  • Graph
  • Create a new issue
  • Jobs
  • Commits
  • Issue Boards
Collapse sidebar
  • XSweetXSweet
  • XSweetXSweet
  • Issues
  • #34
Closed
Open
Issue created Oct 18, 2016 by Alex Theg@athegOwner

Superscripts

The Word doc for chapter 1 of the Berry book - "b01_Chapter1" - shows the "th" part of "13th and 18th" as superscripts in the 1st paragraph below the heading.

After the initial extraction, the "th"s are wrapped inside <vertalign> tags. The scrub step changes that to a span, and the join elements step wraps that into the surrounding p tag. So, the vertaligns disappear and the superscripts do not come through into the HTML.

It looks like Word uses the vertalign for superscripts and probably subscripts too. Can we catch this and carry it over into the HTML? I wonder if there are other ways Word implements sub and supercripts.

Assignee
Assign to
Time tracking