src/sage/misc/sagedoc.py: Add latex to unicode mappings #35493

mkoeppe · 2023-04-13T05:32:16Z

📚 Description

Docstrings formatted for the terminal, in uses like CliffordAlgebra?, are difficult to read when heavy LaTeX markup is used.
We add some mappings that replace LaTeX commands by Unicode characters.

Resolves #35491

📝 Checklist

The title is concise, informative, and self-explanatory.
The description explains in detail what this PR is about.
I have linked a relevant issue or discussion.
I have created tests covering the changes.
I have updated the documentation accordingly.

⌛ Dependencies

tscrim · 2023-04-13T07:35:34Z

I agree with this, but I forget what our policy is on assuming that users have a terminal that supports unicode.

Also, what about the other (more common) Greek letters? It would look quite weird to only have some of them changed to unicode.

tobiasdiez · 2023-04-13T09:33:14Z

At JabRef we have made good experiences with the small latex2unicode library. It's written in Scala, so not reusable here but the encoding map is rather complete: https://github.com/tomtung/latex2unicode/blob/master/src/main/scala/com/github/tomtung/latex2unicode/helper/Escape.scala

mkoeppe · 2023-04-13T16:23:06Z

I forget what our policy is on assuming that users have a terminal that supports unicode.

I don't think we have a policy for this formulated anywhere. The sage.tensor and sage.manifolds packages already use Unicode characters, and I don't think we have heard complaints about that.

I wouldn't be concerned about terminal capabilities. I'd think the most plausible compatibility issue would be (1) workflows in which users copy-paste terminal output into a LaTeX document, without loading the necessary packages for input and font configuration; or (2) programs that run Sage as a subprocess.

mkoeppe · 2023-04-13T16:25:16Z

At JabRef we have made good experiences with the small latex2unicode library. It's written in Scala, so not reusable here but the encoding map is rather complete: https://github.com/tomtung/latex2unicode/blob/master/src/main/scala/com/github/tomtung/latex2unicode/helper/Escape.scala

Thanks for the pointer! Looking great. I think somewhere in IPython/Jupyter there also must be a package that contains such mappings already, for offering tab-completion with the latex names.

mkoeppe · 2023-04-13T16:30:57Z

what about the other (more common) Greek letters? It would look quite weird to only have some of them changed to unicode.

Yes, it's not complete, of course; this is just a mockup that makes CliffordAlgebra?, ExteriorAlgebra? and some tensor docstrings look good.

Also, because the substitutions are regex-based, there is a potential for breakage from unintended matches. For example, so far I have shied away from handling the \ (explicit space) command. So I think we need some kind of systematic testing to avoid unwelcome surprises.

mkoeppe · 2023-04-13T16:33:36Z

And finally, it seems to me that this is something that should be taken care of in a more reusable way, perhaps a Sphinx extension. Mildly related:

Replace sage.misc.sphinxify with docrepr #33682

tscrim · 2023-04-14T07:35:18Z

what about the other (more common) Greek letters? It would look quite weird to only have some of them changed to unicode.

Yes, it's not complete, of course; this is just a mockup that makes CliffordAlgebra?, ExteriorAlgebra? and some tensor docstrings look good.

The unfortunate side-effect is that it makes any doc that only does part of it look very broken. This is a good proof-of-concept right now, but I think it would be hard to convince people that our doc formatter is not horribly broken without doing certain subsets (e.g., all Greek letters).

Also, because the substitutions are regex-based, there is a potential for breakage from unintended matches. For example, so far I have shied away from handling the \ (explicit space) command. So I think we need some kind of systematic testing to avoid unwelcome surprises.

Indeed, that might be a hard one to deal with. I think we would be better of replacing our docstrings with something like \quad that is easier to identify (or at least far less likely to be a misidentified).

jhpalmieri · 2023-07-27T20:07:49Z

At JabRef we have made good experiences with the small latex2unicode library. It's written in Scala, so not reusable here but the encoding map is rather complete: https://github.com/tomtung/latex2unicode/blob/master/src/main/scala/com/github/tomtung/latex2unicode/helper/Escape.scala

Thanks for the pointer! Looking great. I think somewhere in IPython/Jupyter there also must be a package that contains such mappings already, for offering tab-completion with the latex names.

I found these but haven't looked at them in any detail:

https://pypi.org/project/latexcodec/
https://pypi.org/project/pylatexenc/ (which took code from the previous item)
https://github.com/mennucc/unicode2latex/blob/main/latex2unicode

github-actions · 2024-03-06T00:10:07Z

Documentation preview for this PR (built with commit e17b688; changes) is ready! 🎉

vbraun force-pushed the develop branch from 883e05f to e349b00 Compare November 12, 2023 16:25

tornaria mentioned this pull request Dec 10, 2023

Replace sage.misc.sphinxify with docrepr #33682

Open

src/sage/misc/sagedoc.py: Add latex to unicode mappings

e17b688

vbraun force-pushed the develop branch from eba5e19 to e5f42fa Compare June 3, 2024 22:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

src/sage/misc/sagedoc.py: Add latex to unicode mappings #35493

src/sage/misc/sagedoc.py: Add latex to unicode mappings #35493

Uh oh!

mkoeppe commented Apr 13, 2023

Uh oh!

tscrim commented Apr 13, 2023

Uh oh!

tobiasdiez commented Apr 13, 2023

Uh oh!

mkoeppe commented Apr 13, 2023

Uh oh!

mkoeppe commented Apr 13, 2023

Uh oh!

mkoeppe commented Apr 13, 2023

Uh oh!

mkoeppe commented Apr 13, 2023

Uh oh!

tscrim commented Apr 14, 2023

Uh oh!

jhpalmieri commented Jul 27, 2023

Uh oh!

github-actions bot commented Mar 6, 2024

Uh oh!

Uh oh!

Uh oh!

src/sage/misc/sagedoc.py: Add latex to unicode mappings #35493

Are you sure you want to change the base?

src/sage/misc/sagedoc.py: Add latex to unicode mappings #35493

Uh oh!

Conversation

mkoeppe commented Apr 13, 2023

📚 Description

📝 Checklist

⌛ Dependencies

Uh oh!

tscrim commented Apr 13, 2023

Uh oh!

tobiasdiez commented Apr 13, 2023

Uh oh!

mkoeppe commented Apr 13, 2023

Uh oh!

mkoeppe commented Apr 13, 2023

Uh oh!

mkoeppe commented Apr 13, 2023

Uh oh!

mkoeppe commented Apr 13, 2023

Uh oh!

tscrim commented Apr 14, 2023

Uh oh!

jhpalmieri commented Jul 27, 2023

Uh oh!

github-actions bot commented Mar 6, 2024

Uh oh!

Uh oh!