Thomas Grafhttps://thomasgraf.net/2020-01-04T00:00:00-05:00Computational Linguist at Stony Brook UniversityCurbing Feature Coding: Strictly Local Feature Assignment2020-01-04T00:00:00-05:002020-01-04T00:00:00-05:00Thomas Graftag:thomasgraf.net,2020-01-04:/output/graf20sciltalk.html<div class="authors">
<p>Graf, Thomas</p>
</div>
<div class="abstract">
<p><span id="abstract-title">Abstract</span>
Graf (2017) warns that every syntactic formalism faces a severe overgeneration problem because of the hidden power of subcategorization.
Any constraint definable in monadic second-order logic can be compiled into the category system so that it is indirectly enforced as part of …</p></div><div class="authors">
<p>Graf, Thomas</p>
</div>
<div class="abstract">
<p><span id="abstract-title">Abstract</span>
Graf (2017) warns that every syntactic formalism faces a severe overgeneration problem because of the hidden power of subcategorization.
Any constraint definable in monadic second-order logic can be compiled into the category system so that it is indirectly enforced as part of subcategorization.
Not only does this kind of feature coding deprive syntactic proposals of their empirical bite, it also undermines computational efforts to limit syntactic formalisms via subregular complexity.
This paper presents a subregular solution to feature coding.
Instead of features being a cheap resource that comes for free, features must be assigned by a transduction.
In particular, category features must be assigned by an input strictly local (<span class="caps">ISL</span>) tree-to-tree transduction, defined here for the first time.
The restriction to <span class="caps">ISL</span> transductions correctly rules out various deviant category systems.</p>
</div>
<div class="files">
<p><span id="files-title">Files</span>
[<a href="https://thomasgraf.net/doc/talks/Graf20SCiLtalk.pdf">pdf</a>]</p>
</div>
<div class="highlight"><pre><span></span><span class="nc">@misc</span><span class="p">{</span><span class="nl">Graf20SCiLtalk</span><span class="p">,</span>
<span class="na">author</span> <span class="p">=</span> <span class="s">{Graf, Thomas}</span><span class="p">,</span>
<span class="na">title</span> <span class="p">=</span> <span class="s">{Curbing Feature Coding: Strictly Local Feature Assignment}</span><span class="p">,</span>
<span class="na">year</span> <span class="p">=</span> <span class="s">{2020}</span><span class="p">,</span>
<span class="na">note</span> <span class="p">=</span> <span class="s">{Slides of a talk given at \emph{{SCiL} 2020}, January 2--5, co-located with the {LSA 2020}}</span>
<span class="p">}</span>
</pre></div>A Subregular Bound on the Complexity of Lexical Quantifiers2019-12-18T00:00:00-05:002019-12-18T00:00:00-05:00Thomas Graftag:thomasgraf.net,2019-12-18:/output/graf19actalk.html<div class="authors">
<p>Graf, Thomas</p>
</div>
<div class="abstract">
<p><span id="abstract-title">Abstract</span>
Semantic automata theory studies the complexity of generalized quantifiers in terms of the string languages that describe their truth conditions.
An important point has gone unnoticed so far: for most quantifiers that are determiners, these string languages are <em>subregular</em>.
Whereas quantifier phrases …</p></div><div class="authors">
<p>Graf, Thomas</p>
</div>
<div class="abstract">
<p><span id="abstract-title">Abstract</span>
Semantic automata theory studies the complexity of generalized quantifiers in terms of the string languages that describe their truth conditions.
An important point has gone unnoticed so far: for most quantifiers that are determiners, these string languages are <em>subregular</em>.
Whereas quantifier phrases such as <em>an even number of</em> correspond to specific regular languages, <em>every</em>, <em>no</em>, <em>some</em> and <em>not all</em> stay within the much weaker class of <em>tier-based strictly local</em> languages (<span class="caps">TSL</span>).
In addition, it seems to hold crosslinguistically that a <span class="caps">TSL</span> quantifier may be expressed by a single lexical item only if its tier projection is monotonic.
This constitutes a novel typological universal that limits how complex a meaning may be stored in a single lexical item.
<span class="caps">TSL</span> is also known to play a central role in phonology, morphology, and syntax, which suggests that subregularity in general and tier-based strict locality in particular may be a computational universal of natural language that surfaces across all its submodules, including semantics.</p>
</div>
<div class="files">
<p><span id="files-title">Files</span>
[<a href="https://thomasgraf.net/doc/talks/Graf19ACtalk.pdf">pdf</a>]</p>
</div>
<div class="highlight"><pre><span></span><span class="nc">@misc</span><span class="p">{</span><span class="nl">Graf19ACtalk</span><span class="p">,</span>
<span class="na">author</span><span class="p">=</span><span class="s">{Graf, Thomas}</span><span class="p">,</span>
<span class="na">title</span><span class="p">=</span><span class="s">{A Subregular Bound on the Complexity of Lexical Quantifiers}</span><span class="p">,</span>
<span class="na">year</span> <span class="p">=</span> <span class="s">{2019}</span><span class="p">,</span>
<span class="na">note</span> <span class="p">=</span> <span class="s">{Slides of a talk given at the Amsterdam Colloquium, {D}ecember 18--20, University of Amsterdam, Amsterdam, Netherlands}</span><span class="p">,</span>
<span class="p">}</span>
</pre></div>A Subregular Bound on the Complexity of Lexical Quantifiers2019-12-15T00:00:00-05:002019-12-15T00:00:00-05:00Thomas Graftag:thomasgraf.net,2019-12-15:/output/graf19ac.html<div class="authors">
<p>Graf, Thomas</p>
</div>
<div class="abstract">
<p><span id="abstract-title">Abstract</span>
Semantic automata theory studies the complexity of generalized quantifiers in terms of the string languages that describe their truth conditions.
An important point has gone unnoticed so far: for most quantifiers that are determiners, these string languages are <em>subregular</em>.
Whereas quantifier phrases …</p></div><div class="authors">
<p>Graf, Thomas</p>
</div>
<div class="abstract">
<p><span id="abstract-title">Abstract</span>
Semantic automata theory studies the complexity of generalized quantifiers in terms of the string languages that describe their truth conditions.
An important point has gone unnoticed so far: for most quantifiers that are determiners, these string languages are <em>subregular</em>.
Whereas quantifier phrases such as <em>an even number of</em> correspond to specific regular languages, <em>every</em>, <em>no</em>, <em>some</em> and <em>not all</em> stay within the much weaker class of <em>tier-based strictly local</em> languages (<span class="caps">TSL</span>).
In addition, it seems to hold crosslinguistically that a <span class="caps">TSL</span> quantifier may be expressed by a single lexical item only if its tier projection is monotonic.
This constitutes a novel typological universal that limits how complex a meaning may be stored in a single lexical item.
<span class="caps">TSL</span> is also known to play a central role in phonology, morphology, and syntax, which suggests that subregularity in general and tier-based strict locality in particular may be a computational universal of natural language that surfaces across all its submodules, including semantics.</p>
</div>
<div class="files">
<p><span id="files-title">Files</span>
[<a href="https://thomasgraf.net/doc/papers/Graf19AC.pdf">pdf</a>]</p>
</div>
<div class="highlight"><pre><span></span><span class="nc">@inproceedings</span><span class="p">{</span><span class="nl">Graf19AC</span><span class="p">,</span>
<span class="na">author</span> <span class="p">=</span> <span class="s">{Graf, Thomas}</span><span class="p">,</span>
<span class="na">title</span> <span class="p">=</span> <span class="s">{A Subregular Bound on the Complexity of Lexical Quantifiers}</span><span class="p">,</span>
<span class="na">year</span> <span class="p">=</span> <span class="s">{2019}</span><span class="p">,</span>
<span class="na">booktitle</span> <span class="p">=</span> <span class="s">{Proceedings of the 22nd {A}msterdam {C}olloquium}</span><span class="p">,</span>
<span class="na">editor</span> <span class="p">=</span> <span class="s">{Schl\"{o}der, Julian J. and McHugh, Dean and Roelofsen, Floris}</span><span class="p">,</span>
<span class="na">pages</span> <span class="p">=</span> <span class="s">{455--464}</span><span class="p">,</span>
<span class="p">}</span>
</pre></div>Subregular Linguistics for Linguists2019-11-08T00:00:00-05:002019-11-08T00:00:00-05:00Thomas Graftag:thomasgraf.net,2019-11-08:/output/graf19umasstalk.html<div class="authors">
<p>Graf, Thomas</p>
</div>
<div class="abstract">
<p><span id="abstract-title">Abstract</span>
Drawing from computational work that is known as the <strong>subregular program</strong>, I argue against two received views in linguistics: “phonology and syntax are very different’ and “subcategorization is a solved problem”.</p>
<ol>
<li>
<p><strong>Cognitive parallelism</strong><br>
Subregular notions of complexity can be applied to strings …</p></li></ol></div><div class="authors">
<p>Graf, Thomas</p>
</div>
<div class="abstract">
<p><span id="abstract-title">Abstract</span>
Drawing from computational work that is known as the <strong>subregular program</strong>, I argue against two received views in linguistics: “phonology and syntax are very different’ and “subcategorization is a solved problem”.</p>
<ol>
<li>
<p><strong>Cognitive parallelism</strong><br>
Subregular notions of complexity can be applied to strings as well as trees.
Doing so reveals that phonology and syntax are remarkably similar (and those parallels even extend into morphology and semantics).
For instance, islands and blocking effects are instances of the same computational mechanism.</p>
</li>
<li>
<p><strong>Subcategorization</strong><br>
Subcategorization (or c-selection) is rarely studied by linguists, but it is actually a source of tremendous overgeneration.
Once again subregular notions of complexity can be used to address this problem.
This isn’t just a mathematical exercise, but makes concrete empirical predictions about the nature of category systems, subcategorization, the status of empty heads, the <span class="caps">DP</span>-analysis, <span class="caps">DM</span>-style roots, and once again highlights parallels to phonology.</p>
</li>
</ol>
<p>The general upshot is that subregular concepts, despite their computational origin, are intuitive and linguistically fertile: they address conceptual issues, bridge gaps between linguistic subfields, and make concrete empirical predictions.
Subregular linguistics is just linguistics with some computational flavor sprinkled on top.</p>
<p><em>Disclaimer:</em> This talk is 100% formula-free.</p>
</div>
<div class="files">
<p><span id="files-title">Files</span>
[<a href="https://thomasgraf.net/doc/talks/Graf19UMasstalk.pdf">pdf</a>]</p>
</div>
<div class="highlight"><pre><span></span><span class="nc">@misc</span><span class="p">{</span><span class="nl">Graf19UMasstalk</span><span class="p">,</span>
<span class="na">author</span> <span class="p">=</span> <span class="s">{Graf, Thomas}</span><span class="p">,</span>
<span class="na">title</span> <span class="p">=</span> <span class="s">{Subregular Linguistics for Linguists}</span><span class="p">,</span>
<span class="na">year</span> <span class="p">=</span> <span class="s">{2019}</span><span class="p">,</span>
<span class="na">note</span> <span class="p">=</span> <span class="s">{Invited talk, November 8, University of Massachusetts Amherst, Amherst, Massachusetts}</span>
<span class="p">}</span>
</pre></div>Features: More Trouble Than They’re Worth?2019-05-27T00:00:00-04:002019-05-27T00:00:00-04:00Thomas Graftag:thomasgraf.net,2019-05-27:/output/graf19tromsotalk.html<div class="authors">
<p>Graf, Thomas</p>
</div>
<div class="abstract">
<p><span id="abstract-title">Abstract</span>
Do contemporary syntactic theories rely on too many features?
Decades of computational research have culminated in two contradictory answers:</p>
<ul>
<li>Yes, there are too many features because having at least two features is already one too many and brings in undesirable overgeneration.</li>
<li>No …</li></ul></div><div class="authors">
<p>Graf, Thomas</p>
</div>
<div class="abstract">
<p><span id="abstract-title">Abstract</span>
Do contemporary syntactic theories rely on too many features?
Decades of computational research have culminated in two contradictory answers:</p>
<ul>
<li>Yes, there are too many features because having at least two features is already one too many and brings in undesirable overgeneration.</li>
<li>No, the number of features is irrelevant because a grammar with no features can be just as powerful as one with thousands of features.</li>
</ul>
<p>The source of this confusing state of affairs is the intimate relation between features and constraints.
Features can be replaced by constraints, to a point where the grammar is completely feature-free.
And constraints can be compiled down into the feature system until no constraints are left and everything is done by feature checking.</p>
<p><strong>Proposition 1</strong>: Features and constraints are interchangeable.</p>
<p>The power to express arbitrary constraints causes massive overgeneration, which must be curtailed.
There is a substantive body of computational work on how constraints can be limited in a linguistically natural fashion, whereas it is largely unclear how the same can be achieved with restrictions on feature systems.
I will sketch what options have been explored so far and why constraints are easier to rein in than features.</p>
<p><strong>Proposition 2</strong>: The power of constraints is easy to restrict, that of features is not.</p>
<p>In the absence of meaningful restrictions on features, then, constraint-based accounts should be preferred.
I present small case studies on successive-cyclic movement, selection, and morphosyntax that illustrate what this approach looks like in practice and how it can yield new theoretical and empirical insights.</p>
</div>
<div class="files">
<p><span id="files-title">Files</span>
[<a href="https://thomasgraf.net/doc/talks/Graf19Tromsotalk.pdf">pdf</a>]</p>
</div>
<div class="highlight"><pre><span></span><span class="nc">@misc</span><span class="p">{</span><span class="nl">Graf19Tromsotalk</span><span class="p">,</span>
<span class="na">author</span> <span class="p">=</span> <span class="s">{Graf, Thomas}</span><span class="p">,</span>
<span class="na">title</span> <span class="p">=</span> <span class="s">{Features: More Trouble Than They're Worth?}</span><span class="p">,</span>
<span class="na">year</span> <span class="p">=</span> <span class="s">{2019}</span><span class="p">,</span>
<span class="na">note</span> <span class="p">=</span> <span class="s">{Invited talk, May 27, Workshop on \emph{Thirty Million Theories of Syntactic Features}, University of Troms\o, Troms\o, Norway}</span>
<span class="p">}</span>
</pre></div>Diagnosing Movement via the Absence of c-Command Relations2019-03-23T00:00:00-04:002019-03-23T00:00:00-04:00Thomas Graftag:thomasgraf.net,2019-03-23:/output/graflaszakovits19plctalk.html<div class="authors">
<p>Graf, Thomas, and Sabine Laszakovits</p>
</div>
<div class="abstract">
<p><span id="abstract-title">Abstract</span>
Based on an earlier finding that almost all c-command dependencies amount to subregular constraints on c-strings, we argue that all exceptions to this generalization involve movement.
Since regulating movement is already known to be beyond the purview of c-strings …</p></div><div class="authors">
<p>Graf, Thomas, and Sabine Laszakovits</p>
</div>
<div class="abstract">
<p><span id="abstract-title">Abstract</span>
Based on an earlier finding that almost all c-command dependencies amount to subregular constraints on c-strings, we argue that all exceptions to this generalization involve movement.
Since regulating movement is already known to be beyond the purview of c-strings, it is only natural that phenomena that deeply interact with movement may require additional mechanisms beyond c-strings.</p>
</div>
<div class="files">
<p><span id="files-title">Files</span></p>
</div>
<div class="highlight"><pre><span></span><span class="nc">@misc</span><span class="p">{</span><span class="nl">GrafLaszakovits19PLCtalk</span><span class="p">,</span>
<span class="na">author</span> <span class="p">=</span> <span class="s">{Graf, Thomas and Laszakovits, Sabine}</span><span class="p">,</span>
<span class="na">title</span> <span class="p">=</span> <span class="s">{Diagnosing Movement via the Absence of c-Command Relations}</span><span class="p">,</span>
<span class="na">year</span> <span class="p">=</span> <span class="s">{2019}</span><span class="p">,</span>
<span class="na">note</span> <span class="p">=</span> <span class="s">{Slides presented at the 43rd Penn Linguistics Colloquium (PLC43), March 22--24,</span>
<span class="s"> University of Pennsylvania, Philadelphia, PA}</span>
<span class="p">}</span>
</pre></div>C-Command Dependencies as TSL String Constraints2019-01-05T00:00:00-05:002019-01-05T00:00:00-05:00Thomas Graftag:thomasgraf.net,2019-01-05:/output/grafshafiei19scilposter.html<div class="authors">
<p>Graf, Thomas, and Nazila Shafiei</p>
</div>
<div class="abstract">
<p><span id="abstract-title">Abstract</span>
We provide a general formal framework for analyzing c-command based dependencies in syntax, e.g. binding and <span class="caps">NPI</span> licensing, from a subregular perspective.
C-command relations are represented as strings computed from Minimalist derivation trees, and syntactic dependencies are shown …</p></div><div class="authors">
<p>Graf, Thomas, and Nazila Shafiei</p>
</div>
<div class="abstract">
<p><span id="abstract-title">Abstract</span>
We provide a general formal framework for analyzing c-command based dependencies in syntax, e.g. binding and <span class="caps">NPI</span> licensing, from a subregular perspective.
C-command relations are represented as strings computed from Minimalist derivation trees, and syntactic dependencies are shown to be input-output tier-based strictly local over such strings.
The complexity of many syntactic phenomena thus is comparable to dependencies in phonology and morphology.</p>
</div>
<div class="files">
<p><span id="files-title">Files</span>
[<a href="https://thomasgraf.net/doc/talks/GrafShafiei19SCiLposter.pdf">pdf</a>]</p>
</div>
<div class="highlight"><pre><span></span><span class="nc">@misc</span><span class="p">{</span><span class="nl">GrafShafiei19SCiLposter</span><span class="p">,</span>
<span class="na">author</span> <span class="p">=</span> <span class="s">{Graf, Thomas and Shafiei, Nazila}</span><span class="p">,</span>
<span class="na">title</span> <span class="p">=</span> <span class="s">{C-Command Dependencies as {TSL} String Constraints}</span><span class="p">,</span>
<span class="na">year</span> <span class="p">=</span> <span class="s">{2019}</span><span class="p">,</span>
<span class="na">note</span> <span class="p">=</span> <span class="s">{Poster presented at \emph{{SCiL} 2019}, January 3--6, co-located with the {LSA 2019}}</span>
<span class="p">}</span>
</pre></div>The Computational Cost of Generalizations: An Example from Micromorphology2019-01-05T00:00:00-05:002019-01-05T00:00:00-05:00Thomas Graftag:thomasgraf.net,2019-01-05:/output/moradietal19scilposter.html<div class="authors">
<p>Moradi, Sedigheh, Alëna Aksënova, and Thomas Graf</p>
</div>
<div class="abstract">
<p><span id="abstract-title">Abstract</span>
The central concern of linguistics is to succinctly state generalizations.
But as numerous linguists have pointed out over the years, generalizations do not always come for free.
A formalism’s ability to account for the data does …</p></div><div class="authors">
<p>Moradi, Sedigheh, Alëna Aksënova, and Thomas Graf</p>
</div>
<div class="abstract">
<p><span id="abstract-title">Abstract</span>
The central concern of linguistics is to succinctly state generalizations.
But as numerous linguists have pointed out over the years, generalizations do not always come for free.
A formalism’s ability to account for the data does not entail that it can express the relevant generalizations about the data.
We illustrate this point with a computational case study in micro-morphology.
More precisely, we show that the subregular complexity of adjective inflection in Noon varies depending on what generalizations one wishes to capture.</p>
</div>
<div class="files">
<p><span id="files-title">Files</span>
[<a href="https://thomasgraf.net/doc/talks/MoradiEtAl19SCiLposter.pdf">pdf</a>]</p>
</div>
<div class="highlight"><pre><span></span><span class="nc">@misc</span><span class="p">{</span><span class="nl">MoradiEtAl19SCiLposter</span><span class="p">,</span>
<span class="na">author</span> <span class="p">=</span> <span class="s">{Moradi, Sedigheh and Aks\"{e}nova, Al\"{e}na and Graf, Thomas}</span><span class="p">,</span>
<span class="na">title</span> <span class="p">=</span> <span class="s">{The Computational Cost of Generalizations: An Example from Micromorphology}</span><span class="p">,</span>
<span class="na">year</span> <span class="p">=</span> <span class="s">{2019}</span><span class="p">,</span>
<span class="na">note</span> <span class="p">=</span> <span class="s">{Poster presented at \emph{{SCiL} 2019}, January 3--6, co-located with the {LSA 2019}}</span>
<span class="p">}</span>
</pre></div>