Skip to content
GitLab
Projects Groups Snippets
  • /
  • Help
    • Help
    • Support
    • Community forum
    • Submit feedback
  • Register
  • Sign in
  • XSweet XSweet
  • Project information
    • Project information
    • Activity
    • Labels
    • Members
  • Repository
    • Repository
    • Files
    • Commits
    • Branches
    • Tags
    • Contributors
    • Graph
    • Compare
    • Locked Files
  • Issues 52
    • Issues 52
    • List
    • Boards
    • Service Desk
    • Milestones
    • Iterations
    • Requirements
  • Merge requests 2
    • Merge requests 2
  • CI/CD
    • CI/CD
    • Pipelines
    • Jobs
    • Schedules
    • Test Cases
  • Deployments
    • Deployments
    • Environments
    • Releases
  • Monitor
    • Monitor
    • Incidents
  • Analytics
    • Analytics
    • Value stream
    • CI/CD
    • Code review
    • Insights
    • Issue
    • Repository
  • Wiki
    • Wiki
  • Activity
  • Graph
  • Create a new issue
  • Jobs
  • Commits
  • Issue Boards
Collapse sidebar
  • XSweetXSweet
  • XSweetXSweet
  • Issues
  • #154
Closed
Open
Issue created Aug 07, 2018 by Alex Theg@athegOwner

Extract math from Word

It looks like there are 2 main ways of embedding math into .docx files (other than plain text):

  1. Using the built-in equation editor. This uses a tag XML structure - no binaries, it's all inline:
<m:oMathPara>
  <m:oMath>
  1. MathType, the most common math add-on for Word, which uses math binaries that need to be extracted.

For both of these, we should be representing these in MathML (as the standard for HTML5). It looks like we will have to define the mapping for the first option, which could be pretty time consuming. For MathType, we'll need to convert the binaries. @jure's made a ruby gem that converts from MathType to MathML. It may be that we'll need to do a rewrite of this to use it, but it could be a helpful resource.

Assignee
Assign to
Time tracking