README.md 2.68 KB
Newer Older
Wendell Piez's avatar
Wendell Piez committed
1
2
3
4
# XSweet docx to html extraction and more

*Including extraction of document contents from MS Office Open XML into HTML*

Alex Theg's avatar
roadmap    
Alex Theg committed
5
6
7
8
## Roadmap

|Project            |Description                                    |In progress?   |Done     |Issue|
|:---:              |:---:                                          |:---:          |:---:    |:---:|
Alex Theg's avatar
Alex Theg committed
9
10
11
12
13
|XSweet             |Inline and class formatting extraction         |               |✔ |XSweet/XSweet#13|
|XSweet             |Capture hyperlinks from MS Word                |               |✔ |XSweet/XSweet#3|
|XSweet             |Capture end- and footnotes as linked HTML      |               |✔ |XSweet/XSweet#2, XSweet/XSweet#22|
|Editoria Typescript|Preserve note linkages for Wax                 |               |✔ |XSweet/editoria_typescript#8|
|XSweet             |Recreate Word tables as HTML                   |               |✔ |XSweet/XSweet#66|
Alex Theg's avatar
Alex Theg committed
14
|Editoria Typescript|Handle tables in Wax                           |✔       |         |     |
Alex Theg's avatar
Alex Theg committed
15
|XSweet             |Basic list HTML representation                 |               |✔ |XSweet/XSweet#106|
Alex Theg's avatar
Alex Theg committed
16
|XSweet             |Capture list type (unordered, numbered, etc.)  |✔       |         |     |
Alex Theg's avatar
Alex Theg committed
17
18
19
20
21
22
23
24
|HTMLevator         |Heading inferrer                               |               |✔ |XSweet/HTMLevator/#13|
|HTMLevator         |Heading inferencer Word style improvements     |               |         |XSweet/HTMLevator/#14|
|HTMLevator         |Plain-text output                              |               |✔ |XSweet/HTMLevator/#12|
|HTMLevator         |Section inferrer                               |               |✔ |     |
|HTMLevator         |Copyediting cleanups and mappings              |               |✔ |XSweet/editoria_typescript/issues#21|
|HTMLevator         |Support customized transformations             |               |✔ |     |
|XSweet             |Extract images to HTML; store image files      |no but priority|         |XSweet/XSweet#110|
|Editoria Typescript|Convert image references for porting to Wax    |no but priority|         |     |
Alex Theg's avatar
Alex Theg committed
25
|XSweet             |Capture Math (possibly multipe formats)        |✔ OMML supported  |         |XSweet/XSweet#154     |
Alex Theg's avatar
Alex Theg committed
26
27
|XSweet             |Support auto-generated fields                  |               |         |XSweet/XSweet#98|
|XSweet             |Support for language features                  |               |         |     |
Alex Theg's avatar
Alex Theg committed
28
|XSweet             |Support for multiple citation formats          |no but priority|         |     |
Alex Theg's avatar
roadmap    
Alex Theg committed
29

Alex Theg's avatar
Alex Theg committed
30
For full XSweet documentation, visit https://xsweet.org/xsweet-core/.
Wendell Piez's avatar
Wendell Piez committed
31

Alex Theg's avatar
Alex Theg committed
32
Check out the other XSweet tools at https://gitlab.coko.foundation/XSweet.