Sound Change Appliers
Anthologica Universe Atlas / Forums / Miscellaneria / Sound Change Appliers

? Morrígan Witch Queen of New York
posts: 303
, Marquise message
What format are they in? You may not really need to do very much.

I haven't done it yet, but I really need to make it so that we can import tables....
? Nessari ?????? ?????? ????????
posts: 932
, Illúbequía, Seattle, Cascadia
message
DRtnAmB.png
14-16 pages of that. It takes a while to pull things out.

(Yes, that's a Rarity UI with Flutterbat as desktop background.)

edit: …shit. How did I forget to fix that? *grumbles*
? Morrígan Witch Queen of New York
posts: 303
, Marquise message
Are those tables inside a Writer file? Because there should be a way to throw that into a spreadsheet.
? Nessari ?????? ?????? ????????
posts: 932
, Illúbequía, Seattle, Cascadia
message
Not that I've looked particularly hard, but I have yet to find a non-manual one.
? Jipí der saz ûf eime steine
posts: 291
, Transition Metal on exhibit in Victoria, Canada
message
Table → Convert → Text to table → copypasta → Calc
? Morrígan Witch Queen of New York
posts: 303
, Marquise message
You might be able to use fixed-width or something, or use text editor with column selection. I could give it a shot for you, see if there's anything semi-reasonable we can do to clean it up.
? masako posts: 206
, Conversational Speaker message
quoting Jipí, Alkali Metal on exhibit in Victoria, Canada:
Table → Convert → Text to table → copypasta → Calc

+1 Confirmed.
? Nessari ?????? ?????? ????????
posts: 932
, Illúbequía, Seattle, Cascadia
message
Still going to take a lot of hand fixing. I've actually been working on that, it's just going to take a long while. There's multiple, multiple different numbers of tabs on various lines, sometimes multiple lines, so the table's a freaking mess.
? Uzhdarchios posts: 19
, Foreigner message
quoting Nessari:
There's multiple, multiple different numbers of tabs on various lines, sometimes multiple lines, so the table's a freaking mess.

Before you convert to a table: Ctrl+h, click the plus sign next to “Other options”, check “Regular expressions”, search for \t\t and replace with \t. Then repeat until all tabs are single tabs.
? Herra Ratatoskr posts: 7
, Foreigner message
Hey, I'm having some weird outputs from HTS, and I was hoping you could explain why it's happening, so I can figure out how to avoid these sorts of problems. Here's the rules and variables in question:

Æ  = æː  æ  æ̀ː  æ̀  æːˈ  æˈ
EA  = æːa æa  æ̀ːa æ̀ a æːˈa æˈa

EA > Æ

If it's relevant, there are also variables E and A, which follow the same schema as EA and Æ, except they have "e" and "a" in place of "æ(a)".

It is changing all the combinations with æ that I expect it to. However, [a] is becoming [æːˈ], which It shouldn't do. For instance, "lang" becoming "læːˈng" and "manij" becoming "mæːˈnij". Am I missing something obvious here?
? Hallow XIII Primordial Crab
posts: 539
, 侯 of Kentucky
message
the space.

{æ̀ a} <<<

in other words, a (EA_5) becomes æːˈ (Æ_5)
? Herra Ratatoskr posts: 7
, Foreigner message
D'OH! >.< Thanks, Hallow XIII. Not sure how that space got in there.
? Morrígan Witch Queen of New York
posts: 303
, Marquise message
0.1.0 Release!
I made an official release with a binary and everything!

This version support scripting, tab-delimited files, and conditions with `NOT` statements.


https://github.com/sfmorrigan/toolbox-sca/releases/tag/v0.1.0
? Morrígan Witch Queen of New York
posts: 303
, Marquise message
I made my first Devblog post, summarizing project goals and future directions. Of course, I've probably discussed all of these issues here.
? Pthagnar Benedictine Ovulation
posts: 209
, Quaestor, Foraminis Aspirationis
message
you've missed an i out of 'pernicious' on https://github.com/sfmorrigan/toolbox-sca/blob/master/README.md
? Morrígan Witch Queen of New York
posts: 303
, Marquise message
Thanks. I decided to entirely rewrite that clause, "pernicious" was really not great word-choice.
? Morrígan Witch Queen of New York
posts: 303
, Marquise message
So, I changed the readme on the site and re-set it in LaTeX.
SCA Manual PDF

I've been working a lot on cleaning up the code involved in using the feature models so that they won't suck. Also I have a website.
? Morrígan Witch Queen of New York
posts: 303
, Marquise message
I have a serious question concerning how the SCA is started and runs:
Basic - user provides input,  output, and rule file at startup
Enhanced - user only provides rule file; input and outputs are specified within the rules

The current situation leaves us with an uncomfortable state of affairs,  for technical reasons. The problem is that in "basic" mode, the whole rule file is parsed before loading the lexicon, so if the user messes about with normalization or segmentation flags, it's possible for the lexicon to be loaded using the wrong settings, if the user is not very careful.

I'm wondering what all of  you might prefer I do. I could remove basic mode entirely, since personally I feel like it's a lot easier to just use load statements than have to modify a batch  or shell file. Alternatively, I could change basic mode to restrict the ability to change segmentation and normalization settings.

At the very least, I think I will reduce the number of segmentation/normalization options, mostly because the majority are not necessary.  When using intelligent segmentation (the new manual will have an appendix on this), it's best to use  Canonical Decomposition1. When using no segmentation, there is really no reason to use normalization at all; it's really best to leave all inputs just as they are.

If  I make these changes, the number of options (and potential for difficulty) is reduced and it is much easier to flag on the command line. This seems like the best compromise and is what I will work toward doing soon.

Other news: I refactored the state-machine code and should be able to introduce negative conditions soon. At the very least, one will be able to negate single characters, if not entire groups.

Additionally, I'm fairly close changing the SCA to optionally use features. Unlike IPA Zounds, you will be able to mix literals like a and features like [-voiced;+continuant], since the former can simply be converted to the latter internally.

_________________________
1. Canonical Composition works too, but there's really no reason to use it.
? Morrígan Witch Queen of New York
posts: 303
, Marquise message
Addendum: Future Work
I've also been thinking about building a UI - I realize I've said in the past that I didn't see a point. Certainly for the purposes of running rule files, a UI is useless at best, and a hindrance at worst.

However there is substantial value in building what is essentially an IDE. Even something as simple as editing a rules file with syntax highlighting1, and executing the file would be a good use of a UI. But there is a lot of potential value in building this type of platform, like:
Providing error highlighting
View lexicons loaded by the rule file
Debug rules: step through rules and see how they change your lexicon, including "before" and "after" forms highlighting the differences
Mouse-over variable in rule to see what it contains at that point in the rule.
Phonetic feature editor
Mouse-over symbol to see feature values
Dependency diagram to show feed and bleed orders of rules

I'm looking into using JavaFX for this, but admittedly have only ever developed one relatively simple MV*-like application using JQuery and Handlebars. I suspect supporting the kinds of functionality I'd like to see would require a huge amount of work, and might detract from my ability to continue some of my research efforts, but it would be fantastic to have a platform with these capabilities.

Though it will mean cleaning up the API more, but I've been working on that and making good progress.

_________________________
1. where the editor knows what symbols are defined as variables, for instance
? Morrígan Witch Queen of New York
posts: 303
, Marquise message
v0.5.0 Release
Guess what I finally did this morning

https://github.com/samanthamccabe/toolbox-sca/releases/tag/v0.5.0

Quick rundown:
I fixed a lot of minor bugs. Create shell scripts  / batch file to make running it easier. Changed how the program is invoked slightly. Wrote a 12 page manual, provided example files. Rewrote the regular expression engine, changed how normalization works (also, it's turned off by default).

If anyone wants it, I made a syntax hilighter in Sublime Text 3 also
http://haedus.org/releases/ToolboxSCA.tmLanguage
notices