Skip to content

Add speech rules for black circled Latin letters (U+1F150–1F169)#524

Merged
NSoiffer merged 2 commits intodaisy:mainfrom
brichwin:add-black-circled-letters
Mar 27, 2026
Merged

Add speech rules for black circled Latin letters (U+1F150–1F169)#524
NSoiffer merged 2 commits intodaisy:mainfrom
brichwin:add-black-circled-letters

Conversation

@brichwin
Copy link
Copy Markdown
Contributor

Faculty use circled capital letters (U+24B6–U+24CF) to create bubble-sheet like multiple-choice layouts for Gradescope-graded exams in LaTeX. The filled/black circled variants (U+1F150–1F169) are used in solution files to indicate the correct answer.

MathCAT already handled the open circled letters but was missing the black circled versions, leaving the correct-answer indicators unspoken.

This PR adds entries for U+1F150–1F169 to the following locale rule files, following the existing patterns in each:
de, en, es, fi, fr, nb, sv, vi, zh-tw

Translations were made by examining the other black circled character translations and then verifying with translate.google.com by testing speech text from:

<math xmlns="http://www.w3.org/1998/Math/MathML" display="block">
  <mtext>Ⓐ</mtext>
  <mtext>🅐</mtext>
</math>

@moritz-gross
Copy link
Copy Markdown
Collaborator

moritz-gross commented Mar 22, 2026

The auto-translation for German is wrong for "circled".
It should be the non-inflected form "eingekreist", as directly above.

I tried out the prompt

"check the translations in this PR for me: https://github.com/daisy/MathCAT/pull/524/changes"

in ChatGPT, and interestingly, it only found the same single error I also found, though my explanation on why its wrong was by far not as grammatically sound (though I learned "inflected form" in English hehe)

@brichwin
Copy link
Copy Markdown
Contributor Author

@moritz-gross Thanks!

@moritz-gross
Copy link
Copy Markdown
Collaborator

@NSoiffer I think this one is ready.
Technically I'm allowed to merge, but for the parts outside of PythonScripts I'd rather have you have the final look.

@NSoiffer
Copy link
Copy Markdown
Collaborator

Great thanks for the PR and the correction to it. I'll merge it, but for completeness, it would be good to extend this to the circled numbers also. Any takers?

AI says these are:

  1. Enclosed Alphanumerics
    This is the primary block for circled characters. It starts with the digit 1 (the circled zero is located later in the block).

    First Circled Number (1): U+2460 (①)

    Note: Circled zero (⓪) is at U+24EA.

  2. Dingbats
    This block contains decorative, heavy, and serif-style circled numbers.

    First Circled Number (1): U+2776 (❶) — Black Circled Digit One

    Note: The serif version (➀) starts at U+277F.

  3. Enclosed Alphanumeric Supplement
    This block contains circled numbers from 21 through 50.

    First Circled Number (21): U+1F151 (㉑)

  4. CJK Symbols and Punctuation
    This block contains circled numbers specifically designed to match East Asian typography (often used for dates or list markers).

    First Circled Number (1): U+303D (〽) is a separator, but the circled numbers 1-10 start at U+3220 (㈠).

    Note: These are technically "parenthesized" or "circled" Ideographs/Digits.

@NSoiffer NSoiffer merged commit ef975fd into daisy:main Mar 27, 2026
5 checks passed
@moritz-gross moritz-gross mentioned this pull request Mar 27, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants