<?xml version='1.0' encoding='UTF-8'?><?xml-stylesheet href="http://www.blogger.com/styles/atom.css" type="text/css"?><feed xmlns='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/' xmlns:georss='http://www.georss.org/georss' xmlns:gd='http://schemas.google.com/g/2005' xmlns:thr='http://purl.org/syndication/thread/1.0'><id>tag:blogger.com,1999:blog-8540876</id><updated>2011-12-05T17:21:05.206+02:00</updated><category term='Classes'/><category term='Publications'/><category term='Math'/><category term='Downloads'/><category term='Students'/><category term='Türkçe'/><category term='Projects'/><category term='Links'/><category term='Books'/><category term='Notes'/><title type='text'>Deniz Yuret's Homepage</title><subtitle type='html'></subtitle><link rel='http://schemas.google.com/g/2005#feed' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/posts/default'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default?max-results=100'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/'/><link rel='hub' href='http://pubsubhubbub.appspot.com/'/><link rel='next' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default?start-index=101&amp;max-results=100'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><generator version='7.00' uri='http://www.blogger.com'>Blogger</generator><openSearch:totalResults>174</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>100</openSearch:itemsPerPage><entry><id>tag:blogger.com,1999:blog-8540876.post-8143818544357348053</id><published>2011-10-29T14:49:00.000+03:00</published><updated>2011-10-30T09:04:54.108+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Notes'/><title type='text'>Gamma distribution</title><content type='html'>The Gamma distribution is often used as a prior for positive random variables just like the Gaussian distribution for real valued random variables. &amp;nbsp;The purpose of this post is to build some intuition about how the two parameters, the shape parameter "a" and the scale parameter "b", effect the behavior of a Gamma random variable. &amp;nbsp;In particular we will show that for vague Gamma parameters (a&amp;lt;&amp;lt;1) the distribution almost acts like an upper bound on the random variable. &lt;span class="fullpost"&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Here is the Gamma PDF:&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;$f(x) = \frac{1}{\Gamma(a) b} (\frac{x}{b})^{a-1} e^{-x/b}&amp;nbsp;\;\; x\geq 0; a , b&amp;gt;0$&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The mean is ab and the variance is ab&amp;sup2;. &amp;nbsp;When a=1 it is equivalent to the exponential distribution. &amp;nbsp;In fact when a is an integer, it is equivalent to the sum of (a) independent exponentially distributed random variables each of which has a mean of (b). &amp;nbsp;It is shaped like the exponential distribution with a spike at 0 for a&amp;lt;1, but has a mode at (a-1)b for a&amp;gt;1&amp;nbsp;(see the&amp;nbsp;&lt;a href="http://en.wikipedia.org/wiki/Gamma_distribution"&gt;Wikipedia article&lt;/a&gt;).&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;MacKay suggests representing the positive real variable x in terms of its logarithm z=ln x (&lt;a href="http://www.inference.phy.cam.ac.uk/mackay/itila"&gt;ITILA&lt;/a&gt;, pp. 314). &amp;nbsp;This will give us a better idea about the order of magnitude of typical x in terms of a and b. &amp;nbsp;The distribution in terms of z is:&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;$f(z) = \frac{1}{\Gamma(a)} (\frac{x}{b})^a e^{-x/b} &amp;nbsp;\;\; z \in \Re; x=e^z; a, b&amp;gt;0$&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;We can get an idea about the shape of f(z) by looking at its first two derivatives with respect to z:&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;$f'(z) = f(z) (a-\frac{x}{b})$&lt;/div&gt;&lt;div&gt;$f''(z) = f(z) (a^2 - (2a+1)\frac{x}{b} + (\frac{x}{b})^2)$&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://4.bp.blogspot.com/-qswDJ15-IRU/Tqu9or4YGUI/AAAAAAAAAfA/KCjP6-RPWRA/s1600/gamma.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="240" src="http://4.bp.blogspot.com/-qswDJ15-IRU/Tqu9or4YGUI/AAAAAAAAAfA/KCjP6-RPWRA/s320/gamma.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The graph above shows f(z) and its two derivatives for a=1/10 and b=10. The first derivative tells us that f(z) has a single mode at x=ab. Note that x=ab is the mean of f(x) but only the mode (not the mean) of f(z). The curve raises slowly on the left of the mode and falls sharply on the right. The second derivative has two roots that give us the values with the minimum and the maximum slope:&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;$x = ab + \frac{b}{2} \pm \frac{b}{2} \sqrt{1+4a}$.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Now we are going to look at the limit where a&amp;lt;&amp;lt;1, typically used as a vague prior. The height of the mode at x=ab is a&lt;sup&gt;a&lt;/sup&gt;e&lt;sup&gt;-a&lt;/sup&gt;/&amp;Gamma;(a).  &amp;Gamma;(a) is well approximated by 1/a for small a, a&lt;sup&gt;a&lt;/sup&gt; and e&lt;sup&gt;-a&lt;/sup&gt; both go to 1, so f(z) &amp;asymp; a at the mode.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Next, let's look at the&amp;nbsp;right side (x&amp;gt;ab) where f(z) seems to fall sharply. &amp;nbsp;According to the roots of the second derivative given above, the minimum slope occurs at around x=b (if we ignore the terms with a&amp;lt;&amp;lt;1). &amp;nbsp;The value of f(z) when x=b is 1/(e &amp;Gamma;(a)). &amp;nbsp;&amp;Gamma;(a) is well approximated by 1/a for small a, so this value is approximately a/e. &amp;nbsp;The slope at x=b is approximately -a/e and if we fit a line at that point the line would cross 0 at x=eb.  Thus for small a, the probability can be considered negligible for x&amp;gt;eb.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Next, let's look at the left side (x &amp;lt; ab) where f(z) appears more flat. &amp;nbsp;The maximum slope occurs around x=a&amp;sup2;b (if we approximate &amp;radic; 1+4a with 1+2a-2a&amp;sup2;). &amp;nbsp;The slope at&amp;nbsp;x=a&amp;sup2;b is approximately a&amp;sup2; which gives a flat shape for x&amp;lt;ab when a&amp;lt;&amp;lt;1.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;In summary, when used with a&amp;lt;&amp;lt;1, f(z) rises slowly for x&amp;lt;ab (with approximate slope a&amp;sup2;) and falls sharply for x&amp;gt;ab (with approximate slope -a/e). &amp;nbsp;You are unlikely to see x values larger than eb from such a distribution, but you may see values much smaller than the mean ab. &amp;nbsp;Thus a vague Gamma prior is practically putting an upper bound on your positive value. &amp;nbsp;The figure below shows how the f(z) distribution starts looking like a step function as the shape parameter approaches 0 (b=1/a and the peak heights have been matched for comparison).&lt;/div&gt;&lt;br /&gt;&lt;div class="separator" style="clear: both; text-align: center;"&gt;&lt;a href="http://4.bp.blogspot.com/-C5j-_SXbjlU/Tqv3TbKDFOI/AAAAAAAAAfI/-pwbhjdudnc/s1600/gamma2.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="240" src="http://4.bp.blogspot.com/-C5j-_SXbjlU/Tqv3TbKDFOI/AAAAAAAAAfI/-pwbhjdudnc/s320/gamma2.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;I should also note that in the limit where a&amp;rarr;0 and ab=1, we get an improper prior where f(z) becomes flat and the Gamma distribution becomes indifferent to the order of magnitude of the random variable.  However it flattens a lot faster on the left than on the right.&lt;/div&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-8143818544357348053?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/8143818544357348053/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=8143818544357348053' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/8143818544357348053'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/8143818544357348053'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2011/10/gamma-distribution.html' title='Gamma distribution'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://4.bp.blogspot.com/-qswDJ15-IRU/Tqu9or4YGUI/AAAAAAAAAfA/KCjP6-RPWRA/s72-c/gamma.png' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-8211879535732871290</id><published>2011-08-16T10:52:00.003+03:00</published><updated>2011-10-24T18:01:40.162+03:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Students'/><title type='text'>Ergun Biçici, Ph.D. 2011</title><content type='html'>&lt;b&gt;The Regression Model of Machine Translation&lt;/b&gt;&lt;br /&gt;Ergun Biçici.  Ph.D. Dissertation.  Koç University, Department of Computer Engineering.  August, 2011.  (&lt;a href="https://docs.google.com/viewer?a=v&amp;amp;pid=explorer&amp;amp;chrome=true&amp;amp;srcid=0B6C4-zOYlkxsNTE5Y2Y0NTItNWY1NC00MDY2LTg2NjYtNGI5MTI2NzcwYjA3&amp;amp;hl=en"&gt;PDF&lt;/a&gt;, &lt;a href="https://docs.google.com/viewer?a=v&amp;amp;pid=explorer&amp;amp;chrome=true&amp;amp;srcid=0B6C4-zOYlkxsMzRlMmUwOGEtODNiYy00MjA2LTk2ZTAtZWQwNTQ5YmUzNjMw&amp;amp;hl=en_US"&gt;Presentation&lt;/a&gt;)&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Abstract:&lt;/b&gt;&lt;br /&gt;Machine translation is the task of automatically finding the translation of a source sentence in the target language.  Statistical machine translation (SMT) use parallel corpora or bilingual paired corpora that are known to be translations of each other to find a likely translation for a given source sentence based on the observed translations.  The task of machine translation can be seen as an instance of estimating the functions that map strings to strings. &lt;span class="fullpost"&gt;&lt;br /&gt;&lt;br /&gt;Regression based machine translation (RegMT) approach provides a learning framework for machine translation, separating learning models for training, training instance selection, feature representation, and decoding. We use the transductive learning framework for making the RegMT approach computationally more scalable and consider the model building step independently for each test sentence.  We develop training instance selection algorithms that not only make RegMT computationally more scalable but also improve the performance of standard SMT systems. We develop better training instance selection techniques than previous work from given parallel training sentences for achieving more accurate RegMT models using less training instances.&lt;br /&gt;&lt;br /&gt;We introduce L_1 regularized regression as a better model than L_2 regularized regression for statistical machine translation.  Our results demonstrate that sparse regression models are better than L_2 regularized regression for statistical machine translation in predicting target features, estimating word alignments, creating phrase tables, and generating translation outputs.  We develop good evaluation techniques for measuring the performance of the RegMT model and the quality of the translations.  We use F_1 measure, which performs good when evaluating translations into English according to human judgments. F_1 allows us to evaluate the performance of the RegMT models using the target feature prediction vectors or the coefficients matrices learned or a given SMT model using its phrase table without performing the decoding step, which can be computationally expensive.&lt;br /&gt;&lt;br /&gt;Decoding is dependent on the representation of the training set and the features used.  We use graph decoding on the prediction vectors represented in n-gram or word sequence counts space found in the training set. We also decode using Moses after transforming the learned weight matrix representing the mappings between the source and target features to a phrase table that can be used by Moses during decoding.  We demonstrate that sparse L_1 regularized regression performs better than L_2 regularized regression in the German-English translation task and in the Spanish-English translation task when using small sized training sets. Graph based decoding can provide an alternative to phrase-based decoding in translation domains having low vocabulary.&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-8211879535732871290?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/8211879535732871290/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=8211879535732871290' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/8211879535732871290'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/8211879535732871290'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2011/08/ergun-bicici-phd-2011.html' title='Ergun Biçici, Ph.D. 2011'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-7256743956571709506</id><published>2011-07-30T23:09:00.001+03:00</published><updated>2011-08-14T23:52:15.118+03:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Publications'/><title type='text'>RegMT System for Machine Translation, System Combination, and Evaluation</title><content type='html'>Ergun Bicici; Deniz Yuret.  &lt;i&gt;Proceedings of the Sixth Workshop on Statistical Machine Translation.&lt;/i&gt;  pp. 323-329.  Edinburgh, Scotland.  July, 2011.  (&lt;a href="http://aclweb.org/anthology-new/W/W11/W11-2137.pdf"&gt;PDF&lt;/a&gt;, &lt;a href="http://aclweb.org/anthology-new/W/W11/W11-2137.bib"&gt;BIB&lt;/a&gt;, &lt;a href="http://aclweb.org/anthology-new/W/W11/#2100"&gt;Proceedings&lt;/a&gt;, &lt;a href="https://docs.google.com/viewer?a=v&amp;pid=explorer&amp;chrome=true&amp;srcid=0B6C4-zOYlkxsNzk2NmQ5MTctYjc0Yy00ZDAxLTkwNjktZTc1Y2YwOWFlZjhi&amp;hl=en_US"&gt;Poster&lt;/a&gt;)&lt;br /&gt;&lt;span class="fullpost"&gt;&lt;br /&gt;&lt;b&gt;Abstract:&lt;/b&gt; We present the results we obtain using our RegMT system, which uses transductive regression techniques to learn mappings between source and target features of given parallel corpora and use these mappings to generate machine translation outputs. Our training instance selection methods perform feature decay for proper selection of training instances, which plays an important role to learn correct feature mappings. RegMT uses L2 regularized regression as well as L1 regularized regression for sparse regression estimation of target features. We present translation results using our training instance selection methods, translation results using graph decoding, system combination results with RegMT, and performance evaluation with the&lt;br /&gt;F1 measure over target features as a metric for evaluating translation quality.&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-7256743956571709506?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://aclweb.org/anthology-new/W/W11/W11-2137.pdf' title='RegMT System for Machine Translation, System Combination, and Evaluation'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/7256743956571709506/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=7256743956571709506' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/7256743956571709506'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/7256743956571709506'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2011/07/regmt-system-for-machine-translation.html' title='RegMT System for Machine Translation, System Combination, and Evaluation'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-8384278778229705490</id><published>2011-07-30T22:58:00.021+03:00</published><updated>2011-09-18T12:57:50.431+03:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Publications'/><title type='text'>Instance Selection for Machine Translation using Feature Decay Algorithms</title><content type='html'>Ergun Bicici; Deniz Yuret.  &lt;i&gt;Proceedings of the Sixth Workshop on Statistical Machine Translation.&lt;/i&gt;  pp. 272-283.  Edinburgh, Scotland.  July, 2011.  (&lt;a href="http://aclweb.org/anthology-new/W/W11/W11-2131.pdf"&gt;PDF&lt;/a&gt;, &lt;a href="http://aclweb.org/anthology-new/W/W11/W11-2131.bib"&gt;BIB&lt;/a&gt;, &lt;a href="http://aclweb.org/anthology-new/W/W11/#2100"&gt;Proceedings&lt;/a&gt;, &lt;a href="https://docs.google.com/viewer?a=v&amp;pid=explorer&amp;chrome=true&amp;srcid=0B6C4-zOYlkxsNDc3YWY5NWMtYzUwYS00OWZhLWE1MTAtOGZlMDUxYWI0MGU5&amp;hl=en"&gt;Presentation&lt;/a&gt;)&lt;br /&gt;&lt;span class="fullpost"&gt;&lt;br /&gt;&lt;b&gt;Abstract:&lt;/b&gt; We present an empirical study of instance selection techniques for machine translation. In an active learning setting, instance selection minimizes the human effort by identifying the most informative sentences for translation. In a transductive learning setting, selection of training instances relevant to the test set improves the final translation quality. After reviewing the state of the art in the field, we generalize the main ideas in a class of instance selection algorithms that use feature decay. Feature decay algorithms increase diversity of the training set by devaluing features that are already included. We show that the feature decay rate has a very strong effect on the final translation quality whereas the initial feature values, inclusion of higher order features, or sentence length normalizations do not. We evaluate the best instance selection methods using a standard Moses baseline using the whole 1.6 million sentence English-German section of the Europarl corpus. We show that selecting the best 3000 training sentences for a specific test sentence is sufficient to obtain a score within 1 BLEU of the baseline, using 5% of the training data is sufficient to exceed the baseline, and a ~ 2 BLEU improvement over the baseline is possible by optimally selected subset of the training data. In out-of-domain translation, we are able to reduce the training set size to about 7% and achieve a similar performance with the baseline.&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-8384278778229705490?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://aclweb.org/anthology-new/W/W11/W11-2131.pdf' title='Instance Selection for Machine Translation using Feature Decay Algorithms'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/8384278778229705490/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=8384278778229705490' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/8384278778229705490'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/8384278778229705490'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2011/07/instance-selection-for-machine.html' title='Instance Selection for Machine Translation using Feature Decay Algorithms'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-8323428411896608302</id><published>2011-06-21T22:04:00.002+03:00</published><updated>2011-06-22T02:30:06.711+03:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Notes'/><title type='text'>ACL 2011 Tutorials</title><content type='html'>&lt;div&gt;Here are some notes from the ACL tutorials:&lt;/div&gt;&lt;span class="fullpost"&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;span class="Apple-style-span" style="color: rgb(0, 48, 16); font-family: tahoma; font-size: 13px; "&gt;&lt;a href="http://www.acl2011.org/tutorials_05heinz.shtml" style="color: rgb(0, 0, 119); "&gt;Formal and Empirical Grammatical Inference&lt;/a&gt;&lt;br /&gt; Jeffrey Heinz, Colin de la Higuera and Menno van Zaanen&lt;/span&gt;&lt;div&gt;&lt;span class="Apple-style-span" style="color: rgb(0, 48, 16); font-family: tahoma; font-size: 13px; "&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span"  &gt;Jeffrey and Colin motivated and presented the main results for formal grammatical inference.  Even though many theoretical results are negative, learning is usually possible by restricting the model class (to a well defined subset used by natural languages) or assuming a non-distribution-free setting.  &lt;a href="http://www.amazon.com/Grammatical-Inference-Learning-Automata-Grammars/dp/0521763169"&gt;Colin's book&lt;/a&gt; was recommended as a good introduction to the theory.  It would be interesting to see if Turkish morphotactics or morphophonemics fall into one of the easy-to-learn model subclasses.  You can &lt;a href="http://phonology.cogsci.udel.edu/~heinz/talks/Heinz-HigueraEtAl-2011-FEGI-slides.pdf"&gt;download the slides&lt;/a&gt;.&lt;/span&gt;&lt;/div&gt;  &lt;div&gt;&lt;span class="Apple-style-span"  &gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span"  &gt;&lt;span class="Apple-style-span" style="font-size: 13px; "&gt;&lt;a href="http://www.acl2011.org/tutorials_11druck.shtml" style="color: rgb(0, 0, 119); "&gt;Rich Prior Knowledge in Learning for Natural Language Processing&lt;/a&gt;&lt;br /&gt; Gregory Druck, Kuzman Ganchev, Joao Graca&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span"  &gt;&lt;span class="Apple-style-span" style="font-size: 13px; "&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span"  &gt;&lt;span class="Apple-style-span" style="font-size: 13px; "&gt;There is a recent trend for encoding prior knowledge in learning problems not in the prior distributions but later in the learning process.  The prior knowledge usually comes in the form of feature-class expectations and guiding the model toward the correct expectations is only possible after considering the input.  Posterior regularization, constraint driven learning, and generalized expectation criteria seem to be related implementations of this idea.  You can &lt;a href="http://sideinfo.wikkii.com/wiki/Main_Page"&gt;download the slides&lt;/a&gt;.&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;  &lt;div&gt;&lt;span class="Apple-style-span"  &gt;&lt;span class="Apple-style-span" style="font-size: 13px; "&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span"  &gt;&lt;span class="Apple-style-span" style="font-size: 13px; "&gt;&lt;span class="Apple-style-span" style="font-size: 13px; "&gt;&lt;a href="http://www.acl2011.org/tutorials_18collins.shtml" style="color: rgb(0, 0, 119); "&gt;Dual Decomposition for Natural Language Processing&lt;/a&gt;&lt;br /&gt; Michael Collins and Alexander M Rush&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span"  &gt;&lt;span class="Apple-style-span" style="font-size: 13px; "&gt;&lt;span class="Apple-style-span" style="font-size: 13px; "&gt;&lt;br /&gt; &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div&gt;&lt;span class="Apple-style-span"  &gt;&lt;span class="Apple-style-span" style="font-size: 13px; "&gt;&lt;span class="Apple-style-span" style="font-size: 13px; "&gt;I did not attend this one but &lt;a href="http://people.csail.mit.edu/srush/dual_decomp_tutorial.pdf"&gt;here are the slides&lt;/a&gt;.&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;  &lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-8323428411896608302?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/8323428411896608302/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=8323428411896608302' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/8323428411896608302'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/8323428411896608302'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2011/06/acl-2011-tutorials.html' title='ACL 2011 Tutorials'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-1606608782970598819</id><published>2011-05-23T00:26:00.006+03:00</published><updated>2011-05-23T01:28:32.406+03:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Notes'/><category scheme='http://www.blogger.com/atom/ns#' term='Books'/><title type='text'>Semantic Structures by Ray Jackendoff</title><content type='html'>Jackendoff divides the study of the language faculty into three components: phonological, syntactic, and conceptual.  In speech recognition a model that tells us what word sequences are likely is essential to recognition accuracy, because the acoustic information is often ambiguous (e.g. "She kissed this guy." vs. "She kissed the sky.").  Stated in Jackendoff's terms, a probabilistic model of the syntactic component is helpful in disambiguating what is going on in the phonological component.  Similarly I think a model of what is likely in the conceptual component is essential to resolving ambiguities in the syntactic component.  If "what is likely to be said" is essential in interpreting "what is heard", then "what is likely to be meant" is similarly essential in interpreting "what is said".  &lt;br /&gt;&lt;br /&gt;Unfortunately we do not have good conceptual models yet, so computational linguists still try to make do with error prone hand tagging and shallow machine learning to disambiguate senses, references, and relations.&lt;br /&gt;&lt;span class="fullpost"&gt;&lt;br /&gt;On a side note, each component in Jackendoff's work is modeled after the generative paradigm which, for the syntactic component, is described as follows:&lt;ol&gt;&lt;li&gt;Speakers can understand and create an indefinite number of sentences they have never heard before.&lt;br /&gt;&lt;li&gt;Therefore the repertoire of syntactic structures cannot be characterized as a finite list of sentences.&lt;br /&gt;&lt;li&gt;Nor can it be characterized as an infinite list of possible sentences because we have finite brains.&lt;br /&gt;&lt;li&gt;Thus it MUST be mentally encoded in terms of a finite set of primitives and a finite set of principles of combination that collectively generate the class of possible sentences.&lt;/ol&gt;&lt;br /&gt;Am I the only one befuddled by this argument?  Primitives plus means of combination is certainly one way to create infinity using finite means, but why assume it is the only way?  Dynamic systems, random processes, who knows what else can lead to infinite possible outcomes from a finite initial endowment.  Why just present two strawmen, finite and infinite lists, as the only alternatives to discrete primitives and combination?  Why after a couple of paragraphs further narrow the description to "the argument from creativity to the NECESSITY for principles or rules in syntactic knowledge"?  Discrete primitives with finite and definite constraints and rules of combination is one way to build a representational system, unlikely to be the correct way for all three components of language, and certainly not the only way.&lt;br /&gt;&lt;br /&gt;See also: &lt;a href="http://denizyuret.blogspot.com/2011/01/plausibility-vs-inference.html"&gt;Plausibility vs. Inference&lt;/a&gt;.&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-1606608782970598819?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://www.amazon.com/Semantic-Structures-Current-Studies-Linguistics/dp/026260020X' title='Semantic Structures by Ray Jackendoff'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/1606608782970598819/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=1606608782970598819' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/1606608782970598819'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/1606608782970598819'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2011/05/semantic-structures-by-ray-jackendoff.html' title='Semantic Structures by Ray Jackendoff'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-8586167824531731456</id><published>2011-05-11T10:44:00.004+03:00</published><updated>2011-05-23T01:45:14.348+03:00</updated><title type='text'>The Noisy Channel Model for Unsupervised Word Sense Disambiguation</title><content type='html'>&lt;iframe src="https://docs.google.com/present/embed?id=d2jm3f3_18964v3fzcf2" frameborder="0" width="410" height="342"&gt;&lt;/iframe&gt;&lt;br /&gt;Seminar presentation at ITU on our &lt;a href="http://denizyuret.blogspot.com/2009/09/noisy-channel-model-for-unsupervised.html"&gt;CL paper&lt;/a&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-8586167824531731456?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/8586167824531731456/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=8586167824531731456' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/8586167824531731456'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/8586167824531731456'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2011/05/noisy-channel-model-for-unsupervised.html' title='The Noisy Channel Model for Unsupervised Word Sense Disambiguation'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-3225872467542815232</id><published>2011-04-11T16:15:00.000+03:00</published><updated>2011-04-11T16:16:08.988+03:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Notes'/><title type='text'>A tutorial on language modeling with SRILM</title><content type='html'>&lt;iframe src="https://docs.google.com/present/embed?id=d2jm3f3_1875gpk8r2d3" frameborder="0" width="410" height="342"&gt;&lt;/iframe&gt;&lt;span class="fullpost"&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-3225872467542815232?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/3225872467542815232/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=3225872467542815232' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/3225872467542815232'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/3225872467542815232'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2011/04/tutorial-on-language-modeling-with.html' title='A tutorial on language modeling with SRILM'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-7398584317925364010</id><published>2011-03-24T16:43:00.018+02:00</published><updated>2011-04-06T20:55:27.064+03:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Books'/><title type='text'>From Eternity to Here by Sean Carroll</title><content type='html'>Why is the future different than the past?  The fundamental rules of physics seem as symmetric in time as they are for different directions of space, yet you can't unscramble an egg and you can't remember the future.  The answer seems to lie in the concept of entropy, which tends to increase over time in a closed system unlike energy and momentum which stay constant.  Yet, entropy is one of these concepts that lead to quite a bit of confusion, sort of like (and sometimes for the same reasons as) &lt;a href="http://www.denizyuret.com/2005/06/probability-twisters.html"&gt;probability&lt;/a&gt;.  Sean Carrol's wonderful book tackles the mysteries of entropy and arrow of time.  Here are some entropy puzzles and some further reading:&lt;br /&gt;&lt;span class="fullpost"&gt;&lt;br /&gt;&lt;b&gt;Entropy puzzles:&lt;/b&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;If you find yourself in a universe with low entropy it is as likely to have come from a higher entropy state as it is to evolve into a higher entropy state.  So how do you know you are not a &lt;a href="http://en.wikipedia.org/wiki/Boltzmann_brain"&gt;Boltzmann brain&lt;/a&gt;?&lt;br /&gt;&lt;/li&gt;&lt;li&gt;Take a box with a partition in it, with gas A on one side, gas B on the other side, and both gases are at the same temperature and pressure.  Remove the partition. If gas A and B are different gases, there is an entropy that arises due to the mixing. If the gases are the same, no additional entropy is calculated.  What if you thought they were the same gas and years later it was discovered that they happened to be two different isotopes?  (See &lt;a href="http://en.wikipedia.org/wiki/Gibbs_paradox"&gt;Gibbs paradox&lt;/a&gt; and E.T. Jaynes' &lt;a href="http://bayes.wustl.edu/etj/articles/gibbs.paradox.pdf"&gt;paper&lt;/a&gt;).  More generally this microstate / macrostate business seems completely user defined and arbitrary, so how can it have real physical effects?&lt;br /&gt;&lt;/li&gt;&lt;li&gt;In a reversible system there must be just as many paths that decrease the entropy as that increase the entropy.  Why don't we observe as many of the first type as the second?&lt;br /&gt;&lt;/li&gt;&lt;li&gt;A gas squeezed in the corner of a room will tend to spread thereby increase its disorder and entropy.  If we add an attractive force like gravity matter seems to clump together rather than spread out.  How does clumping together increase entropy?&lt;br /&gt;&lt;/li&gt;&lt;li&gt;A rotting plant turns into dust and gas which increases disorder and entropy.  A seed turns a bunch of gas and dust into a full grown tree which seems to decrease entropy.  This can only happen because the seed is not a closed system and is using the energy from the sun and ends up increasing the overall entropy of the universe at the end.  When, how, and why does this type of thing happen?&lt;br /&gt;&lt;/li&gt;&lt;/ul&gt;&lt;br /&gt;&lt;b&gt;Further reading:&lt;/b&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;a href="http://www.amazon.com/Eternity-Here-Quest-Ultimate-Theory/dp/0452296544"&gt;From Eternity to Here&lt;/a&gt;: Sean Carroll's book that inspired this post.&lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.amazon.com/Labyrinth-Time-Introducing-Universe/dp/0199249954"&gt;Labyrinth of Time&lt;/a&gt;: Michael Lockwood's book on the arrow of time with a bit more philosophy and a bit less black hole physics.&lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.amazon.com/Time-Chance-David-Z-Albert/dp/0674011325"&gt;Time and Chance&lt;/a&gt;: David Z. Albert's book gives one of the clearest discussions of thermodynamics (both classical and statistical) and its relation to the arrow of time.&lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.amazon.com/Good-Real-Demystifying-Paradoxes-Bradford/dp/0262042339"&gt;Good and Real&lt;/a&gt;: Gary L. Drescher's book that tackles not only the arrow of time, but  quantum indeterminacy, consciousness, free will, and ethics.  This is one of my favorite books which inspired a &lt;a href="http://www.denizyuret.com/2006/11/termodinamigin-ikinci-kanunu-uzerine.html"&gt;few&lt;/a&gt; &lt;a href="http://www.denizyuret.com/2006/10/on-ethics.html"&gt;earlier&lt;/a&gt; &lt;a href="http://www.denizyuret.com/2006/10/prediction-determinism-and-free-will.html"&gt;posts&lt;/a&gt;.&lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://bayes.wustl.edu"&gt;E. T. Jaynes&lt;/a&gt; has a &lt;a href="http://bayes.wustl.edu/etj/articles/prob.as.logic.pdf"&gt;number&lt;/a&gt; &lt;a href="http://bayes.wustl.edu/etj/articles/prob.in.qm.pdf"&gt;of&lt;/a&gt; &lt;a href="http://bayes.wustl.edu/etj/articles/cmystery.pdf"&gt;articles&lt;/a&gt; &lt;a href="http://bayes.wustl.edu/etj/articles/ccarnot.pdf"&gt;that&lt;/a&gt; &lt;a href="http://bayes.wustl.edu/etj/articles/cgibbs.pdf"&gt;clarify&lt;/a&gt; some of the mysteries.  See also these &lt;a href="http://www.mdpi.org/lin/entropy"&gt;links&lt;/a&gt;.&lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.amazon.com/Permutation-City-Greg-Egan/dp/006105481X"&gt;Permutation City&lt;/a&gt;: One of my &lt;a href="http://www.denizyuret.com/2007/02/permutation-city-greg-egan.html"&gt;favorite&lt;/a&gt; sci-fi novels by &lt;a href="http://www.gregegan.net"&gt;Greg Egan&lt;/a&gt; explores the nature of time, simulation and reality.  Maybe instants are not ordered in time at all, the state of our short term memory seems to give this impression.&lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://space.mit.edu/home/tegmark/home.html"&gt;Max Tegmark&lt;/a&gt; at MIT came up with the &lt;a href="http://en.wikipedia.org/wiki/Mathematical_universe_hypothesis"&gt;Mathematical universe hypothesis&lt;/a&gt; which is reminiscent of the &lt;a href="http://www.gregegan.net/PERMUTATION/FAQ/FAQ.html"&gt;Dust theory&lt;/a&gt; that underlies Permutation City.  See also Hans Moravec's &lt;a href="http://www.frc.ri.cmu.edu/~hpm/project.archive/general.articles/1998/SimConEx.98.html"&gt;essay&lt;/a&gt;.&lt;br /&gt;&lt;/li&gt;&lt;/ul&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-7398584317925364010?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://www.amazon.com/Eternity-Here-Quest-Ultimate-Theory/dp/0525951334' title='From Eternity to Here by Sean Carroll'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/7398584317925364010/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=7398584317925364010' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/7398584317925364010'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/7398584317925364010'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2011/03/from-eternity-to-here-by-sean-carroll.html' title='From Eternity to Here by Sean Carroll'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-4577038831216231591</id><published>2011-03-17T13:55:00.000+02:00</published><updated>2011-03-17T13:56:46.412+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Projects'/><title type='text'>Bologna Translation Service</title><content type='html'>&lt;iframe src="https://docs.google.com/present/embed?id=d2jm3f3_1861pszqj3cq" frameborder="0" width="410" height="342"&gt;&lt;/iframe&gt;&lt;br /&gt;&lt;span class="fullpost"&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-4577038831216231591?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/4577038831216231591/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=4577038831216231591' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/4577038831216231591'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/4577038831216231591'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2011/03/bologna-translation-service.html' title='Bologna Translation Service'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-7757861527507049955</id><published>2011-01-03T09:39:00.004+02:00</published><updated>2011-01-03T10:59:59.618+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Notes'/><title type='text'>Plausibility vs. Inference</title><content type='html'>Here are two examples of common sense judgement:&lt;br /&gt;&lt;br /&gt;(1) I saw the Statue of Liberty flying over New York. =&gt; I was flying over New York.&lt;br /&gt;(2) I gave the book to Mary.  =&gt; Mary has the book.&lt;span class="fullpost"&gt;&lt;br /&gt;&lt;br /&gt;The first one involves syntactic disambiguation.  Either the subject or the object could be doing the flying (consider "I saw the airplane flying over New York.")  Our "common sense" tells us that the Statue is too big to fly, so I am the more likely one flying.&lt;br /&gt;&lt;br /&gt;The second one involves a semantic inference.  The meaning of give involves a physical transfer or a transfer of possession, as a consequence the item given ends up with the recipient.  Our "common sense" is full of such little factoids (here is another: "Oswald killed Kennedy." =&gt; "Kennedy is dead.") which let us see beyond what is explicitly stated in the text.&lt;br /&gt;&lt;br /&gt;I want to emphasize that these examples are qualitatively different and calling them both "common sense judgements" may be confusing.  The first one is a plausibility judgement (which is more likely to fly: me or the statue?).  The second one is an exact inference, i.e. "give" definitely causes transfer and "kill" definitely causes death.  To solve the first one we need a model of what is more likely to be happening in the world.  To solve the second one we need more traditional inference of what entails what.&lt;br /&gt;&lt;br /&gt;Disambiguation problems in computational linguistics (word sense disambiguation, resolving syntactic ambiguities, etc.) rely on plausibility judgements, not exact inference.  A lot of work in AI "common sense reasoning" will not help there because traditionally reasoning and inference work focus on exact judgements.&lt;br /&gt;&lt;br /&gt;As far as I can see nobody is working on plausibility judgements explicitly.  Researchers use corpus statistics as a proxy to solve disambiguation problems.  This may be obscuring the real issue: I think the right way to do linguistic disambiguation is to have a model of what is plausible in the world.&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-7757861527507049955?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/7757861527507049955/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=7757861527507049955' title='3 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/7757861527507049955'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/7757861527507049955'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2011/01/plausibility-vs-inference.html' title='Plausibility vs. Inference'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>3</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-6872359910845064788</id><published>2010-12-23T16:59:00.007+02:00</published><updated>2010-12-23T17:28:27.918+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Notes'/><title type='text'>Research focus</title><content type='html'>In 1976 John McCarthy, one of the founders of artificial intelligence, wrote a memo discussing the problem of getting a computer to understand a story from the New York Times: &lt;span class="fullpost"&gt;&lt;blockquote&gt;&lt;i&gt;"A 61-year old furniture salesman, John J. Hug, was pushed down the shaft of a freight elevator yesterday in his downtown Brooklyn store by two robbers while a third attempted to crush him with the elevator car because they were dissatisfied with the $1,200 they had forced him to give them." &lt;/i&gt;&lt;/blockquote&gt; McCarthy suggested that a real understanding of this story would entail being able to answer questions like: &lt;blockquote&gt;&lt;i&gt; Who was in the store when the events began?&lt;br /&gt; Who had the money at the end?&lt;br /&gt; Did Mr. Hug know he was going to be robbed?&lt;br /&gt; Does he know now that he was robbed? &lt;/i&gt;&lt;/blockquote&gt;Answering these questions is still beyond the state of the art in natural language processing, because they require common sense knowledge in addition to the text of the story.  In fact the problems associated with answering questions only based on the text of the story are only beginning to be solved on a large scale: &lt;blockquote&gt;&lt;i&gt; When and where was Mr. Hug pushed?&lt;br /&gt; Who forced who to give $1,200 to whom?&lt;br /&gt; Did the money satisfy the robbers? &lt;/i&gt;&lt;/blockquote&gt; To achieve an understanding at this level, we need to address linguistic problems like word sense disambiguation ("push" has 15 senses), named entity recognition (Mr. Hug = John J. Hug), anaphora resolution (him = John J. Hug), parsing (who did what to whom?), semantic relation identification (dissatisfied with $1,200 = $1,200 did not satisfy).  The figure below illustrates the main challenge: the ambiguity present in most natural language expressions.  Our group studies statistical machine learning methods to address these problems with the eventual goal of natural language understanding by machines.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/_306eXDjZh7g/TRNnBauesAI/AAAAAAAAANc/NdvTlXbRsvM/s1600/dyuret-figure.jpg"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 320px; height: 314px;" src="http://1.bp.blogspot.com/_306eXDjZh7g/TRNnBauesAI/AAAAAAAAANc/NdvTlXbRsvM/s320/dyuret-figure.jpg" border="0" alt=""id="BLOGGER_PHOTO_ID_5553896039529754626" /&gt;&lt;/a&gt;&lt;blockquote&gt;&lt;i&gt;Different interpretations of the sentence: "I saw the man on the hill with a telescope."&lt;/i&gt;&lt;/blockquote&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-6872359910845064788?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/6872359910845064788/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=6872359910845064788' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/6872359910845064788'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/6872359910845064788'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2010/12/research-focus.html' title='Research focus'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://1.bp.blogspot.com/_306eXDjZh7g/TRNnBauesAI/AAAAAAAAANc/NdvTlXbRsvM/s72-c/dyuret-figure.jpg' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-1816354089489137808</id><published>2010-11-16T23:16:00.028+02:00</published><updated>2010-11-17T07:53:16.091+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Notes'/><title type='text'>Naive Bayes is a joint Maximum Entropy Model</title><content type='html'>Naive Bayes and Maximum Entropy models are both popular in NLP applications.  In this post I will show that Naive Bayes is really a type of joint Maximum Entropy model (and much easier to compute).  Maximum Entropy models aim to find the distribution that maximizes entropy while still obeying certain feature expectation constraints.  Maximizing entropy with feature expectation constraints turns out to be equivalent to maximizing likelihood in a log-linear (aka exponential) model.   Logistic regression is a specific type of conditional maxent model.  Naive Bayes is a specific type of joint maxent model. &lt;span class="fullpost"&gt;&lt;br /&gt;&lt;br /&gt;Consider a classification problem with a D dimensional input vector x and a discrete output variable y.  Naive Bayes models assume that the individual components of x are independent given the output y:&lt;br /&gt;&lt;br /&gt;\[ p(\mathbf{x}, y) = p(y) \prod_{d=1}^D p(x_d|y) \]&lt;br /&gt;&lt;br /&gt;The factors on the right hand side are then estimated using maximum likelihood, which boils down to counting frequencies if x and y are both discrete.  In comparison here is a joint maximum entropy model:&lt;br /&gt;&lt;br /&gt;\[ p(\mathbf{x}, y) = \frac{1}{Z} \exp(\sum_{m=1}^M \lambda_m f_m(\mathbf{x}, y)) \]&lt;br /&gt;&lt;br /&gt;where fm are arbitrary real valued "feature functions" of the input and the output and Z is a normalization constant.  This looks nothing like Naive Bayes at first glance, but if we choose feature functions that look at the output y with at most one component of the input x:&lt;br /&gt;&lt;br /&gt;\[ p(\mathbf{x}, y) = \frac{1}{Z} \exp(\lambda_0 f_0(y)) \prod_{d=1}^D \exp(\lambda_d f_d(x_d, y)) \]&lt;br /&gt;&lt;br /&gt;the similarity becomes more apparent.  In fact if the feature functions are binary, this expression will have exactly the same form as the Naive Bayes expression, and maximizing likelihood will give exactly the same answers.  Let us illustrate with an example:&lt;br /&gt;&lt;br /&gt;\[ \begin{array}{cccc}&lt;br /&gt;x_1 &amp; x_2 &amp; y &amp; n \\&lt;br /&gt;1 &amp; 1 &amp; 1 &amp; 3 \\&lt;br /&gt;1 &amp; 1 &amp; 0 &amp; 1 \\&lt;br /&gt;0 &amp; 0 &amp; 1 &amp; 1 \\&lt;br /&gt;0 &amp; 0 &amp; 0 &amp; 3 \\&lt;br /&gt;\end{array} \]&lt;br /&gt;&lt;br /&gt;Here x1 and x2 are inputs, y is the output, and n is the number of times a particular combination has been observed.  The Naive Bayes maximum likelihood estimates will give:&lt;br /&gt;&lt;br /&gt;\[ \begin{array}{l}&lt;br /&gt;p(y=1) = p(y=0) = 1/2 \\&lt;br /&gt;p(x_1=1|y=1) = p(x_2=1|y=1) = 3/4 \\&lt;br /&gt;p(x_1=1|y=0) = p(x_2=1|y=0) = 1/4 \\&lt;br /&gt;\end{array} \]&lt;br /&gt;&lt;br /&gt;which will lead to the following not-so-good estimates:&lt;br /&gt;&lt;br /&gt;\[ \begin{array}{lll}&lt;br /&gt;x_1 &amp; x_2 &amp; p(y=1|x_1, x_2) \\&lt;br /&gt;1 &amp; 1 &amp; 0.9 \\&lt;br /&gt;1 &amp; 0 &amp; 0.5 \\&lt;br /&gt;0 &amp; 1 &amp; 0.5 \\&lt;br /&gt;0 &amp; 0 &amp; 0.1 \\&lt;br /&gt;\end{array} \]&lt;br /&gt;&lt;br /&gt;The equivalent joint maximum entropy model will have 10 feature functions.  The following table specifies when each feature function will return 1, star indicating "don't care":&lt;br /&gt;&lt;br /&gt;\[ \begin{array}{lllll}&lt;br /&gt;f &amp; y &amp; x_1 &amp; x_2 &amp; \lambda \\&lt;br /&gt;f_0 &amp; 0 &amp; * &amp; * &amp; \lambda_0 \\&lt;br /&gt;f_1 &amp; 1 &amp; * &amp; * &amp; \lambda_0 \\&lt;br /&gt;f_2 &amp; 0 &amp; 0 &amp; * &amp; \lambda_1 \\&lt;br /&gt;f_3 &amp; 0 &amp; 1 &amp; * &amp; \lambda_2 \\&lt;br /&gt;f_4 &amp; 1 &amp; 0 &amp; * &amp; \lambda_2 \\&lt;br /&gt;f_5 &amp; 1 &amp; 1 &amp; * &amp; \lambda_1 \\&lt;br /&gt;f_6 &amp; 0 &amp; * &amp; 0 &amp; \lambda_1 \\&lt;br /&gt;f_7 &amp; 0 &amp; * &amp; 1 &amp; \lambda_2 \\&lt;br /&gt;f_8 &amp; 1 &amp; * &amp; 0 &amp; \lambda_2 \\&lt;br /&gt;f_9 &amp; 1 &amp; * &amp; 1 &amp; \lambda_1 \\&lt;br /&gt;\end{array} \]&lt;br /&gt;&lt;br /&gt;Because of the symmetry of this example, not all lambda coefficients will be distinct, in fact only three distinct lambdas are necessary as indicated in the last column of the table.  The maximum likelihood estimates of the lambdas will satisfy the expectation constraints: the empirical frequency of each feature will be equal to the expected frequency:&lt;br /&gt;&lt;br /&gt;\[ \frac{1}{N} \sum_{n=1}^N f(\mathbf{x}_n, y_n) = \sum_{\mathbf{x},y} p(\mathbf{x}, y) f(\mathbf{x}, y) \]&lt;br /&gt;&lt;br /&gt;Here the sum on the left is over the instances in the dataset, whereas the sum on the right is over all possible x, y pairs.  This equation can be derived by setting the derivative of the log likelihood expression to zero.  The following parameters satisfy these constraints and thus maximize the likelihood:&lt;br /&gt;&lt;br /&gt;\[ \lambda_0 = \lambda_2 = 0, \lambda_1 = \log(3), Z = 32 \]&lt;br /&gt;&lt;br /&gt;This joint maximum entropy model gives exactly the same results as the Naive Bayes model.  For example:&lt;br /&gt;&lt;br /&gt;\[ p(x_1=1, x_2=1, y=1) = \frac{1}{Z} \exp(\lambda_0 + 2 \lambda_1) = \frac{9}{32} \]&lt;br /&gt;\[ p(x_1=1, x_2=1, y=0) = \frac{1}{Z} \exp(\lambda_0 + 2 \lambda_2) = \frac{1}{32} \]&lt;br /&gt;&lt;br /&gt;This example is from Chris Manning's NLP class (&lt;a href="http://see.stanford.edu/see/lecturelist.aspx?coll=63480b48-8819-4efd-8412-263f1a472f5a"&gt;lecture 7&lt;/a&gt;) where he compares the Naive Bayes model to a conditional Maximum Entropy model:&lt;br /&gt;&lt;br /&gt;\[ p(y | \mathbf{x}) = \frac{1}{Z(\mathbf{x})} \exp(\sum_{m=1}^M \lambda_m f_m(\mathbf{x}, y)) \]&lt;br /&gt;&lt;br /&gt;A conditional maxent model will satisfy conditional expectation constraints and maximize conditional likelihood (which can again be derived by setting the derivative of the log conditional likelihood to zero):&lt;br /&gt;&lt;br /&gt;\[ \sum_{n=1}^N f(\mathbf{x}_n, y_n) = \sum_{n=1}^N \sum_y p(y|\mathbf{x}_n) f(\mathbf{x}_n, y) \]&lt;br /&gt;&lt;br /&gt;Using the same set of feature functions and parameters, one can derive the values that satisfy the constraints:&lt;br /&gt;&lt;br /&gt;\[ \lambda_0 = \lambda_2 = 0, \lambda_1 = \frac{1}{2} \log(3) \]&lt;br /&gt;&lt;br /&gt;The conditional model will give probabilities that match the data perfectly:&lt;br /&gt;&lt;br /&gt;\[ \begin{array}{lll}&lt;br /&gt;x_1 &amp; x_2 &amp; p(y=1|x_1, x_2) \\&lt;br /&gt;1 &amp; 1 &amp; 3/4 \\&lt;br /&gt;1 &amp; 0 &amp; 1/2 \\&lt;br /&gt;0 &amp; 1 &amp; 1/2 \\&lt;br /&gt;0 &amp; 0 &amp; 1/4 \\&lt;br /&gt;\end{array} \]&lt;br /&gt;&lt;br /&gt;However it is important to understand that this model handles feature interactions better than Naive Bayes not because of any magic associated with maxent models, but as a result of maximizing conditional instead of joint likelihood.  When you maximize joint likelihood, maxent gives the same results as Naive Bayes.  Of course with maxent, one always has the option of defining features with cross terms (more than one input variable) to handle feature interaction.&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-1816354089489137808?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/1816354089489137808/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=1816354089489137808' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/1816354089489137808'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/1816354089489137808'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2010/11/naive-bayes-is-joint-maximum-entropy.html' title='Naive Bayes is a joint Maximum Entropy Model'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-3040782466397353132</id><published>2010-11-15T18:00:00.003+02:00</published><updated>2010-11-15T18:06:54.507+02:00</updated><title type='text'>Next Generation Parser Evaluation</title><content type='html'>An ACL Workshop Proposal by &lt;a href="http://www.cl.cam.ac.uk/~lr346/"&gt;Laura Rimell&lt;/a&gt; and &lt;a href="http://www.denizyuret.com"&gt;Deniz Yuret&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;This workshop aims to foster the development of innovative, targeted, formalism-independent parser evaluation resources and methods that will guide us in building the next generation of parsers.  &lt;span class="fullpost"&gt;&lt;br /&gt;&lt;br /&gt;Under many of our existing evaluation measures, parsing accuracy appears to have plateaued around the 90% mark. To continue making meaningful improvements to parsing technology, we first need to clarify what this 90% represents. Do our evaluations measure semantically-relevant syntactic phenomena? Do they accurately represent multiple domains, languages, and formalisms? How relevant are they for downstream tasks? Do they reflect the level of inter-annotator agreement? We also need to identify and understand the "missing 10%": there is a growing awareness in the community that parsers may perform poorly on less frequent but semantically important syntactic phenomena, but in fact we are not even certain whether such crucial phenomena are represented in our current evaluation schemes. We need new ways of highlighting the specific areas where parsers need to improve.&lt;br /&gt;&lt;br /&gt;We believe parser evaluation should:&lt;br /&gt;&lt;br /&gt;- be relevant for multiple formalisms, languages, and domains&lt;br /&gt;- be targeted towards finding parser weaknesses&lt;br /&gt;- focus on semantically important tasks&lt;br /&gt;- be extrinsic or task-oriented as well as intrinsic&lt;br /&gt;- be based on schemes with high inter-annotator agreement&lt;br /&gt;- show us how we can improve parser training methods&lt;br /&gt;&lt;br /&gt;The workshop builds on the insights gained from the COLING-08 workshop on Cross-Framework and Cross-Domain Parser Evaluation. This earlier workshop made particular inroads towards framework-independent parser evaluation by fostering discussion of formalism-independent schemes, especially grammatical relation schemes.&lt;br /&gt;&lt;br /&gt;Despite the advances made in cross-framework evaluation, such evaluations still suffer from a loss of accuracy arising from conversion between output formats. One recent answer to this problem is the PETE task.  In PETE (Yuret et al., 2010, http://pete.yuret.com) parser evaluation is performed using simple syntactic entailment questions.  Given the sentence "The man who stole my car went to jail", the annotator is asked to judge entailments like "The man went to jail" or "My car went to jail".  This scheme is formalism-independent, has high inter-annotator agreement, and focuses evaluation on semantically relevant distinctions. A new version of PETE will form the shared task for this workshop.&lt;br /&gt;&lt;br /&gt;Another known weakness in existing evaluation measures, including ones based on grammatical relation formalisms, is that they are aggregate measures, in which syntactic phenomena are de facto weighted by frequency rather than by degree of syntactic difficulty or semantic importance. Thus such measures are are likely to have disproportionate contributions from high-frequency, "easy" grammatical phenomena such as determiners and subjects; while frequency weighting is obviously important, it makes it difficult to discern the phenomena where parsers really need to improve.&lt;br /&gt;&lt;br /&gt;One answer to this problem is to focus evaluation on syntactic phenomena which we know to be difficult for parsers, such as the unbounded dependency evaluations performed in Rimell et al. (2009) and Nivre et al. (2010). This area is wide open for development: we have known for a long time that parsers have difficulty with phenomena like coordination and PP attachment, but are there other problematic constructions?  We should also focus on finding new ways of determining which phenomena are most difficult, and hence where we need to focus parser training efforts. Also crucial is finding ways to measure the importance of parser errors for downstream tasks, especially semantic tasks, and weighting parser performance accordingly.&lt;br /&gt;&lt;br /&gt;Third, many evaluations are still intrinsic, and while intrinsic evaluations play an important role -- especially for developing new parsers, and for fine-grained comparisons with previous work -- it is increasingly clear that performance on intrinsic evaluations doesn't always predict task performance.&lt;br /&gt;&lt;br /&gt;Recent papers such as Miyao et al. (2008, 2009) and Miwa et al. (2010) focus on task-based evaluation, especially for the biomedical domain.  We need more evaluations that focus on a greater range of tasks, languages, and domains (or even subdomains, since the field has barely begun to address how the vocabulary and writing conventions across e.g. biomedical subdomains may affect parsing accuracy).&lt;br /&gt;&lt;br /&gt;Finally, unlike other NLP subfields, almost no parser evaluation studies discuss the relevance of inter-annotator agreement. It may be that the 90% evaluation plateau reflects the limits of inter-annotator agreement, but we lack a clear picture of how these figures correspond. New, more natural annotation methods may help in this area.&lt;br /&gt;&lt;br /&gt;At this workshop we especially encourage papers that consider how techniques and resources from other NLP subfields can be brought to bear on parser evaluation.  Perhaps resources annotated with information on compound nouns, subcategorization frames, selectional preferences, or textual entailments may serve as gold standards. Perhaps new gold standards may be created by exploiting shallow parsing or novel approaches to human annotation. Perhaps we can learn something from sentence simplification, semantic parsing, or active learning. Ultimately we are interested in finding new and exciting ways to identify where we need to improve our parsers.&lt;br /&gt;&lt;br /&gt;The workshop will have two parts.&lt;br /&gt;&lt;br /&gt;Part I: PETE-2 shared task. This will be an updated version of the successful SemEval-2010 shared task on Parser Evaluation using Textual Entailments. As noted in Yuret et al. (2010), two important improvements to the task are re-balancing the composition of syntactic phenomena covered in the task dataset, and automating the entailment generation process. Both of these improvements will be made for the new PETE-2 dataset.&lt;br /&gt;&lt;br /&gt;Anyone will be welcome to submit a system to the shared task portion of the workshop, and reports on the shared task will make up part of the workshop program. For teams not wishing to build their own RTE system to interpret their parser output, we will ofter a simple system that generates RTE judgments from Stanford Dependency output, based on the top performing systems from SemEval-10 PETE.&lt;br /&gt;&lt;br /&gt;Part II: Papers. We invite full-length papers which present evaluation resources, tools, techniques, or ideas; results of new evaluations; or new methods for targeted parser training based on evaluation results. We welcome submissions on all related topics, including but not limited to:&lt;br /&gt;&lt;br /&gt;- new formalism-independent evaluation resources &lt;br /&gt;- new domain-specific or cross-domain evaluation resources &lt;br /&gt;- new language-specific or multi-lingual evaluation resources &lt;br /&gt;- new evaluation resources targeted to specific syntactic phenomena &lt;br /&gt;- new approches to identifying syntactic phenomena that are difficult for parsers &lt;br /&gt;- evaluation schemes that consider semantic relevance &lt;br /&gt;- new extrinsic or task-based evaluations &lt;br /&gt;- schemes for improvement of a parser based on evaluation results &lt;br /&gt;- evaluation techniques that consider inter-annotator agreement &lt;br /&gt;- ideas for bringing insights from other NLP subfields to bear on parser evaluation&lt;br /&gt;&lt;br /&gt;Desired Workshop Length: one day&lt;br /&gt;&lt;br /&gt;Estimated Number of Attendees: 25&lt;br /&gt;&lt;br /&gt;Organizers:&lt;br /&gt;&lt;br /&gt;Laura Rimell&lt;br /&gt;Computer Laboratory&lt;br /&gt;University of Cambridge&lt;br /&gt;William Gates Building&lt;br /&gt;15 JJ Thomson Ave&lt;br /&gt;Cambridge&lt;br /&gt;CB3 0FD&lt;br /&gt;United Kingdom&lt;br /&gt;+44 (0)1223 334696&lt;br /&gt;laura.rimell@cl.cam.ac.uk&lt;br /&gt;&lt;br /&gt;Statement of research interests and areas of expertise: Rimell has worked on domain adaptation for parsing and is interested in novel parser evaluation methods. She has worked on the evaluation of a variety of treebank, grammar-based, and dependency parsers on unbounded dependencies and was a contributor to the COLING-08 parser evaluation workshop as well as a member of the top-performing team in the SemEval-10 PETE task. She is also currently working on acquisition of lexical resources and has an interest in their relationship with parsing and parser evaluation.&lt;br /&gt;&lt;br /&gt;Deniz Yuret&lt;br /&gt;Department of Computer Engineering&lt;br /&gt;Koc University&lt;br /&gt;Istanbul&lt;br /&gt;Turkey&lt;br /&gt;+90-212-338-1724&lt;br /&gt;dyuret@ku.edu.tr&lt;br /&gt;&lt;br /&gt;Statement of research interests and areas of expertise: Yuret has worked on unsupervised parsing and various unsupervised disambiguation problems, including word senses, semantic relations, and morphology. He was the organizer of the SemEval-10 PETE task and is currently co-organizing the next SemEval.&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-3040782466397353132?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/3040782466397353132/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=3040782466397353132' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/3040782466397353132'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/3040782466397353132'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2010/11/next-generation-parser-evaluation.html' title='Next Generation Parser Evaluation'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-1380036996883165061</id><published>2010-11-15T17:46:00.004+02:00</published><updated>2010-11-15T18:03:53.484+02:00</updated><title type='text'>Computational Models of Narrative</title><content type='html'>AAAI 2010 Fall Symposium (&lt;a href="http://www.aaai.org/Library/Symposia/Fall/fs10-04.php"&gt;Proceedings&lt;/a&gt;), November 11-13, 2010, Arlington, VA.  &lt;a href="http://www.mit.edu/~markaf/"&gt;Mark Finlayson&lt;/a&gt;, Program Chair.&lt;br /&gt;&lt;br /&gt;Narratives are ubiquitous. We use them to educate, communicate, convince, explain, and entertain. As far as we know every society has narratives, which suggests they are deeply rooted and serve an important cognitive function: that narratives do something for us. It is clear that, to fully explain human intelligence, beliefs, and behaviors, we will have to understand and explain narrative.  &lt;span class="fullpost"&gt;&lt;br /&gt;&lt;br /&gt;The symposium will bring together researchers with a wide variety of perspectives to share what is known about the fundamentals of the computational modeling of narrative and to explore the forefront of that knowledge. We seek participation from as wide a variety of approaches as possible, including not only AI researchers and technologists, but also psychologists, cognitive scientists, linguists, philosophers, narrative theorists, anthropologists, educators, storytellers, and neuroscientists.&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-1380036996883165061?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://narrative.csail.mit.edu/fs10/' title='Computational Models of Narrative'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/1380036996883165061/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=1380036996883165061' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/1380036996883165061'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/1380036996883165061'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2010/11/computational-models-of-narrative.html' title='Computational Models of Narrative'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-111964857749320074</id><published>2010-11-03T20:41:00.005+02:00</published><updated>2011-03-17T23:34:27.226+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Math'/><title type='text'>Probability twisters</title><content type='html'>Of all the &lt;a href="http://denizyuret.blogspot.com/2005/06/deniz-yurets-math-problems.html"&gt;math problems&lt;/a&gt; I collect, these are my favorites. They do not require anything more than elementary math, but they do seem to trigger a software bug in most people's brains. Choose between several different arguments that lead to different answers for each problem.  (Updated Nov 2010: two siblings problem)&lt;span class="fullpost"&gt;&lt;br /&gt;&lt;ol&gt;&lt;li&gt; (Two siblings) If you pick a random family with two kids and calculate the probability of both being girls the obvious answer would be 1/4 (assuming girls and boys being equally likely).  However simple variations of this problem easily lead to some confusion: &lt;ul&gt;&lt;li&gt; Variation 1: If you ask the family whether they have at least one girl, and they say yes, the two girl probability is 1/3.&lt;br /&gt;&lt;/li&gt;&lt;li&gt; Variation 2: If you see one of their kids on the street and notice that she is a girl, the two girl probability is 1/2.&lt;br /&gt;&lt;/ul&gt; You can verify these answers by imagining the sample space of all (say four million) two-child families and assuming equal numbers of boy-boy, boy-girl, girl-boy, and girl-girl families (say one million each).  What is tricky to understand is why these two variations have different answers when it seems like they give you the exact same information.  Here are some more variations:  &lt;br /&gt;&lt;ul&gt;&lt;li&gt; Variation 3: If you learn that the older sibling is a girl, the two girl probability is 1/2.&lt;br /&gt;&lt;/li&gt;&lt;li&gt; Variation 4: If you learn that the family has one girl named Florida, the two girl probability is approximately 1/2.&lt;br /&gt;&lt;/li&gt;&lt;li&gt; Variation 5: If you learn that the family has one girl born on a Wednesday, the two girl probability is 13/27.&lt;br /&gt;&lt;/ul&gt;&lt;/li&gt;&lt;br /&gt;&lt;li&gt; (Umit - Monty Hall Problem) You are a participant on the game show "Let's Make a Deal." Monty Hall shows you three closed doors. He tells you that two of the closed doors have a goat behind them and that one of the doors has a new car behind it. You pick one door, but before you open it, Monty opens one of the two remaining doors and shows that it hides a goat. He then offers you a chance to switch doors with the remaining closed door. Is it to your advantage to do so?&lt;br /&gt;&lt;ul&gt;&lt;li&gt; Argument 1: It does not matter. The probability of finding the car in the remaining two doors was equal in the beginning, and they are still equal now. The fact that you put your hand on one of them cannot increase or decrease its probability of having the car under it.&lt;br /&gt;&lt;/li&gt;&lt;li&gt; Argument 2: If we repeated this experiment a million times, you would get the the car only one third of the time by sticking to your first door. People who consistently switch would win the other two thirds. Therefore you should switch.&lt;br /&gt;&lt;/li&gt;&lt;li&gt; Argument 3: Think about what you would do if there were a thousand doors, rather than three, and Monty Hall opened 998 doors with goats behind them.&lt;br /&gt;&lt;/li&gt;&lt;li&gt; Bibliography: &lt;a href="http://math.rice.edu/~pcmi/mathlinks/montyurl.html"&gt; http://math.rice.edu/~pcmi/mathlinks/montyurl.html&lt;/a&gt;&lt;br /&gt;&lt;/li&gt;&lt;/ul&gt;&lt;br /&gt;&lt;/li&gt;&lt;li&gt; (Encyclopedia of Bridge) You are South with three small of a suit, and dummy has QJ9. You desperately need a trick from this suit. You lead low to the Queen, and East wins with the King. When you get a second chance, you lead low to the J9 and West plays low. Should you play the Jack or the 9?&lt;br /&gt;&lt;ul&gt;&lt;li&gt; Argument 1: If either opponent has A10, it does not matter. If East has the Ace and West the 10, you want to play the 9. If it is the other way around, you want to play the Jack. Both sides are equally likely to have the Ace so it does not matter what you play. &lt;br /&gt;&lt;/li&gt;&lt;li&gt; Argument 2: You should play the Jack because East has the Ace only 1/3 of the time. If East had AK, he would play the King to the first trick only half the time. If he had K10, he would always play the King. Since we know he played the King, it is twice as likely that he has the K10 and not AK. &lt;br /&gt;&lt;/li&gt;&lt;li&gt; Note: Note the similarity with the Monty Hall problem.&lt;br /&gt;&lt;/li&gt;&lt;/ul&gt;&lt;br /&gt;&lt;/li&gt;&lt;li&gt; (Memduh - Two envelope problem) I offer you a pick between two envelopes with money. One envelope has twice as much money as the other. You pick one, and out comes 10 dollars. Now I give you a chance to switch. Would you like to switch? How much are you willing to pay to switch?&lt;br /&gt;&lt;ul&gt;&lt;li&gt; Argument 1: Of course you switch. The expected amount of money in the other envelope is 0.5x5 + 0.5x20 = 12.5 dollars. In fact you are willing to pay up to 2.5 dollars to switch. &lt;br /&gt;&lt;/li&gt;&lt;li&gt; Argument 2: What if I asked you the question before you opened the envelope and saw the 10 dollars? Using the same reasoning, you can assume there is A dollars in the envelope and compute 0.5x(A/2) + 0.5x(2A) = 1.25A for the expected money in the other envelope. So you would switch. Just before you open your new envelope, I ask you whether you would like to switch again? What would your answer be? &lt;br /&gt;&lt;/li&gt;&lt;li&gt; Note: In fact if I can find two people that believe in Argument 1, I can build a money machine. Just keep giving them two envelopes with 5 and 10 dollars and charge for switching... :^) (Of course I charge them whatever comes out of the first envelope for playing the game, so that it is a zero sum game.) &lt;br /&gt;&lt;/li&gt;&lt;li&gt; Bibliography: &lt;a href="http://www.u.arizona.edu/~chalmers/papers/envelope.html"&gt; http://www.u.arizona.edu/~chalmers/papers/envelope.html&lt;/a&gt; &lt;/li&gt;&lt;/ul&gt;&lt;br /&gt;&lt;/li&gt;&lt;li&gt; (Neal) I pick two real numbers. You look at one of them. Can you find a strategy that lets you guess whether you are looking at the larger or smaller number with more than 1/2 success rate.&lt;br /&gt;&lt;ul&gt;&lt;li&gt; Argument 1: Obviously you cannot find such a strategy. &lt;br /&gt;&lt;/li&gt;&lt;li&gt; Argument 2: Take a probability distribution that is non-zero over all the real numbers (standard normal for example). Draw a random number from this distribution and respond assuming that the hidden number is equal to your random number. There are three cases: (i) Your random number will be smaller than both my numbers, in which case you have 50% chance of winning. (ii) Your random number will be larger than both my numbers, in which case you have 50% chance of winning. (iii) Your random number will be between my two numbers, in which case you have 100% chance of winning. The average is greater than 50%. &lt;br /&gt;&lt;/li&gt;&lt;li&gt; Note: Using a similar argument one can show that you could in fact make a profit in the two envelope problem by employing a mixed strategy. &lt;br /&gt;&lt;/li&gt;&lt;/ul&gt;&lt;br /&gt;&lt;/li&gt;&lt;li&gt; (Alkan) You draw a random line that cuts a circle with unit radius. What is the probability that the chord will be longer than sqrt(3)?&lt;br /&gt;&lt;ul&gt;&lt;li&gt; Argument 1: Consider the distance between the midpoint of the chord and the center of the circle. If this distance is less than 1/2 the chord will be longer than sqrt(3). Therefore the answer is 1/2.&lt;br /&gt;&lt;/li&gt;&lt;li&gt; Argument 2: Draw a tangent at one of the points the line intersects the circle. Consider the angle between this tangent and the chord. If this angle is between 60 and 120 degrees, the chord will be longer than sqrt(3). Therefore the answer is 1/3. &lt;br /&gt;&lt;/li&gt;&lt;li&gt; Argument 3: Consider the midpoint of the chord. If this midpoint is within a concentric circle with half the radius, the chord will be longer than sqrt(3). The area of a circle with half the radius is 1/4th of the original. Therefore the answer is 1/4. &lt;/li&gt;&lt;/ul&gt;&lt;/li&gt;&lt;/ol&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-111964857749320074?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/111964857749320074/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=111964857749320074' title='4 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/111964857749320074'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/111964857749320074'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2005/06/probability-twisters.html' title='Probability twisters'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>4</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-793742780993197918</id><published>2010-10-27T14:10:00.004+03:00</published><updated>2011-09-22T19:07:47.519+03:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Publications'/><title type='text'>Anharmonicity, mode-coupling and entropy in a fluctuating native protein</title><content type='html'>A. Kabakçıoğlu, D. Yuret, M. Gür, B. Erman. 2010 Phys. Biol. 7 046005 doi: &lt;a href="http://dx.doi.org/10.1088/1478-3975/7/4/046005"&gt;10.1088/1478-3975/7/4/046005&lt;/a&gt; (&lt;a href="https://docs.google.com/viewer?a=v&amp;pid=explorer&amp;chrome=true&amp;srcid=0B6C4-zOYlkxsMjRjZTI2YmUtNjFiMC00YWY0LThjNTQtYTM3NmVmOGMyYWUz&amp;hl=en"&gt;PDF&lt;/a&gt;, &lt;a href="http://iopscience.iop.org/1478-3975/7/4/046005/pdf/1478-3975_7_4_046005.pdf"&gt;PDF&lt;/a&gt;, &lt;a href="http://iopscience.iop.org/1478-3975/7/4/046005/fulltext"&gt;HTML&lt;/a&gt;, &lt;a href="http://stacks.iop.org/1478-3975/7/046005"&gt;Online&lt;/a&gt;, &lt;a href="https://docs.google.com/uc?id=0B6C4-zOYlkxsZjg1Mzg0MzYtYmRjYi00YWE0LTlkN2YtNTA5MDk4OTYyYjA4&amp;export=download&amp;hl=en"&gt;Hermite code&lt;/a&gt;)&lt;br /&gt;&lt;span class="fullpost"&gt;&lt;br /&gt;&lt;b&gt;Abstract:&lt;/b&gt;  We develop a general framework for the analysis of residue fluctuations that simultaneously incorporates anharmonicity and mode-coupling in a unified formalism. We show that both deviations from the Gaussian model are important for modeling the multidimensional energy landscape of the protein Crambin (1EJG) in the vicinity of its native state. The effect of anharmonicity and mode-coupling on the fluctuational entropy is in the order of a few percent.&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-793742780993197918?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://stacks.iop.org/1478-3975/7/046005' title='Anharmonicity, mode-coupling and entropy in a fluctuating native protein'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/793742780993197918/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=793742780993197918' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/793742780993197918'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/793742780993197918'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2010/10/anharmonicity-mode-coupling-and-entropy.html' title='Anharmonicity, mode-coupling and entropy in a fluctuating native protein'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-3661954698991679489</id><published>2010-10-19T09:04:00.020+03:00</published><updated>2010-11-03T09:08:54.386+02:00</updated><title type='text'>Istanbul Marathon 2010</title><content type='html'>&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/_306eXDjZh7g/TL03B2wqSHI/AAAAAAAAAHc/bknBJIagtpw/s1600/marathon.jpg"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 400px; height: 45px;" src="http://1.bp.blogspot.com/_306eXDjZh7g/TL03B2wqSHI/AAAAAAAAAHc/bknBJIagtpw/s400/marathon.jpg" border="0" alt="" id="BLOGGER_PHOTO_ID_5529636422499846258" /&gt;&lt;/a&gt;Some pictures from the Istanbul Marathon...&lt;span class="fullpost"&gt;&lt;br /&gt;&lt;br /&gt;Here is our route from &lt;a href="http://runkeeper.com/user/denizyuret/activity/18549720"&gt;runkeeper.com&lt;/a&gt;:&lt;br /&gt;&lt;br /&gt;&lt;iframe width="425" height="345" src="http://runkeeper.com/activityMap/b1l20"&gt;&lt;/iframe&gt;&lt;br /&gt;&lt;br /&gt;Some useful links:&lt;br /&gt;&lt;a href="http://evolutionrunning.com/"&gt;evolutionrunning.com&lt;/a&gt;: How to run without injury.&lt;br /&gt;&lt;a href="http://www.amazon.com/Marathon-You-Can-Jeff-Galloway/dp/093607048X"&gt;Galloway's Marathon book&lt;/a&gt;: Everybody can do it!&lt;br /&gt;&lt;a href="http://www.amazon.com/Born-Run-Hidden-Superathletes-Greatest/dp/0307266303"&gt;Born to run&lt;/a&gt;: To get inspired.&lt;br /&gt;&lt;br /&gt;And some pictures:&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/_306eXDjZh7g/TL1ki-l-FLI/AAAAAAAAAMM/VvgKbALnIh4/s1600/photo+(31)_2.jpg"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 320px; height: 240px;" src="http://1.bp.blogspot.com/_306eXDjZh7g/TL1ki-l-FLI/AAAAAAAAAMM/VvgKbALnIh4/s320/photo+(31)_2.jpg" border="0" alt=""id="BLOGGER_PHOTO_ID_5529686469561423026" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/_306eXDjZh7g/TL1kazluxfI/AAAAAAAAAL0/v_K6bCeL_LQ/s1600/photo-8_2.jpg"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 240px; height: 320px;" src="http://1.bp.blogspot.com/_306eXDjZh7g/TL1kazluxfI/AAAAAAAAAL0/v_K6bCeL_LQ/s320/photo-8_2.jpg" border="0" alt=""id="BLOGGER_PHOTO_ID_5529686329168676338" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/_306eXDjZh7g/TL1kKO31WMI/AAAAAAAAALM/SsGQy-yAI4Q/s1600/photo-9_2.jpg"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 320px; height: 240px;" src="http://1.bp.blogspot.com/_306eXDjZh7g/TL1kKO31WMI/AAAAAAAAALM/SsGQy-yAI4Q/s320/photo-9_2.jpg" border="0" alt=""id="BLOGGER_PHOTO_ID_5529686044434585794" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://3.bp.blogspot.com/_306eXDjZh7g/TL1kJZiqIYI/AAAAAAAAAK8/73eQPzj7idA/s1600/photo-10_2.jpg"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 320px; height: 240px;" src="http://3.bp.blogspot.com/_306eXDjZh7g/TL1kJZiqIYI/AAAAAAAAAK8/73eQPzj7idA/s320/photo-10_2.jpg" border="0" alt=""id="BLOGGER_PHOTO_ID_5529686030118691202" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/_306eXDjZh7g/TL1j2X7EADI/AAAAAAAAAKM/s5F_N7XVadQ/s1600/photo+(32)_2.jpg"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 320px; height: 240px;" src="http://1.bp.blogspot.com/_306eXDjZh7g/TL1j2X7EADI/AAAAAAAAAKM/s5F_N7XVadQ/s320/photo+(32)_2.jpg" border="0" alt=""id="BLOGGER_PHOTO_ID_5529685703266664498" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://4.bp.blogspot.com/_306eXDjZh7g/TL1jmozSr9I/AAAAAAAAAJs/chH1Fgm3-vI/s1600/photo-2_2.jpg"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 320px; height: 240px;" src="http://4.bp.blogspot.com/_306eXDjZh7g/TL1jmozSr9I/AAAAAAAAAJs/chH1Fgm3-vI/s320/photo-2_2.jpg" border="0" alt=""id="BLOGGER_PHOTO_ID_5529685432919568338" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://4.bp.blogspot.com/_306eXDjZh7g/TL1kaRmjcdI/AAAAAAAAALc/11hayLBnZG0/s1600/photo+(18)_2.jpg"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 240px; height: 320px;" src="http://4.bp.blogspot.com/_306eXDjZh7g/TL1kaRmjcdI/AAAAAAAAALc/11hayLBnZG0/s320/photo+(18)_2.jpg" border="0" alt=""id="BLOGGER_PHOTO_ID_5529686320045322706" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/_306eXDjZh7g/TL1kJyr3hqI/AAAAAAAAALE/pxIvl5U9P1Y/s1600/photo+(19)_2.jpg"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 320px; height: 240px;" src="http://2.bp.blogspot.com/_306eXDjZh7g/TL1kJyr3hqI/AAAAAAAAALE/pxIvl5U9P1Y/s320/photo+(19)_2.jpg" border="0" alt=""id="BLOGGER_PHOTO_ID_5529686036868204194" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://3.bp.blogspot.com/_306eXDjZh7g/TL1j2XvuUjI/AAAAAAAAAKU/8nEML7BWRCc/s1600/photo+(20)_2.jpg"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 320px; height: 240px;" src="http://3.bp.blogspot.com/_306eXDjZh7g/TL1j2XvuUjI/AAAAAAAAAKU/8nEML7BWRCc/s320/photo+(20)_2.jpg" border="0" alt=""id="BLOGGER_PHOTO_ID_5529685703219106354" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://4.bp.blogspot.com/_306eXDjZh7g/TL1jnd67rII/AAAAAAAAAJ8/ZoKrGzTZX-I/s1600/photo+(21)_2.jpg"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 320px; height: 240px;" src="http://4.bp.blogspot.com/_306eXDjZh7g/TL1jnd67rII/AAAAAAAAAJ8/ZoKrGzTZX-I/s320/photo+(21)_2.jpg" border="0" alt=""id="BLOGGER_PHOTO_ID_5529685447178693762" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/_306eXDjZh7g/TL1j3BrBVUI/AAAAAAAAAKk/hLCl5NUCCS8/s1600/photo-7_2.jpg"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 320px; height: 240px;" src="http://1.bp.blogspot.com/_306eXDjZh7g/TL1j3BrBVUI/AAAAAAAAAKk/hLCl5NUCCS8/s320/photo-7_2.jpg" border="0" alt=""id="BLOGGER_PHOTO_ID_5529685714473669954" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/_306eXDjZh7g/TL1j2t9QkjI/AAAAAAAAAKc/ekTlUTf3fhk/s1600/photo+(30)_2.jpg"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 320px; height: 240px;" src="http://2.bp.blogspot.com/_306eXDjZh7g/TL1j2t9QkjI/AAAAAAAAAKc/ekTlUTf3fhk/s320/photo+(30)_2.jpg" border="0" alt=""id="BLOGGER_PHOTO_ID_5529685709181456946" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://4.bp.blogspot.com/_306eXDjZh7g/TL1jnURDZNI/AAAAAAAAAKE/zoKdpdwjeLE/s1600/photo-6_2.jpg"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 320px; height: 240px;" src="http://4.bp.blogspot.com/_306eXDjZh7g/TL1jnURDZNI/AAAAAAAAAKE/zoKdpdwjeLE/s320/photo-6_2.jpg" border="0" alt=""id="BLOGGER_PHOTO_ID_5529685444587119826" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://3.bp.blogspot.com/_306eXDjZh7g/TL1jl9aQckI/AAAAAAAAAJk/Gmbhjy-WdLc/s1600/photo+(27)_2.jpg"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 320px; height: 240px;" src="http://3.bp.blogspot.com/_306eXDjZh7g/TL1jl9aQckI/AAAAAAAAAJk/Gmbhjy-WdLc/s320/photo+(27)_2.jpg" border="0" alt=""id="BLOGGER_PHOTO_ID_5529685421271839298" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/_306eXDjZh7g/TL1kKfOQ18I/AAAAAAAAALU/3fbiLpdyOKY/s1600/photo+(29)_2.jpg"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 320px; height: 240px;" src="http://1.bp.blogspot.com/_306eXDjZh7g/TL1kKfOQ18I/AAAAAAAAALU/3fbiLpdyOKY/s320/photo+(29)_2.jpg" border="0" alt=""id="BLOGGER_PHOTO_ID_5529686048823629762" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/_306eXDjZh7g/TL1kbNbuXKI/AAAAAAAAAL8/9bQTjmtelbI/s1600/photo+(28)_2.jpg"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 320px; height: 240px;" src="http://2.bp.blogspot.com/_306eXDjZh7g/TL1kbNbuXKI/AAAAAAAAAL8/9bQTjmtelbI/s320/photo+(28)_2.jpg" border="0" alt=""id="BLOGGER_PHOTO_ID_5529686336106028194" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/_306eXDjZh7g/TL1kI8rCGMI/AAAAAAAAAK0/mUZ7PCew-V4/s1600/photo-4_2.jpg"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 320px; height: 240px;" src="http://1.bp.blogspot.com/_306eXDjZh7g/TL1kI8rCGMI/AAAAAAAAAK0/mUZ7PCew-V4/s320/photo-4_2.jpg" border="0" alt=""id="BLOGGER_PHOTO_ID_5529686022369188034" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/_306eXDjZh7g/TL1jmj18a5I/AAAAAAAAAJ0/JCUEIMtT0RE/s1600/photo-5_2.jpg"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 320px; height: 240px;" src="http://2.bp.blogspot.com/_306eXDjZh7g/TL1jmj18a5I/AAAAAAAAAJ0/JCUEIMtT0RE/s320/photo-5_2.jpg" border="0" alt=""id="BLOGGER_PHOTO_ID_5529685431588514706" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/_306eXDjZh7g/TL1kaqYUTjI/AAAAAAAAALk/8vX5xSUVEC4/s1600/photo-1_2.jpg"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 320px; height: 240px;" src="http://2.bp.blogspot.com/_306eXDjZh7g/TL1kaqYUTjI/AAAAAAAAALk/8vX5xSUVEC4/s320/photo-1_2.jpg" border="0" alt=""id="BLOGGER_PHOTO_ID_5529686326696496690" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://3.bp.blogspot.com/_306eXDjZh7g/TL1kilyV56I/AAAAAAAAAME/giDg_PPthwc/s1600/photo-3_2.jpg"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 320px; height: 240px;" src="http://3.bp.blogspot.com/_306eXDjZh7g/TL1kilyV56I/AAAAAAAAAME/giDg_PPthwc/s320/photo-3_2.jpg" border="0" alt=""id="BLOGGER_PHOTO_ID_5529686462902429602" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/_306eXDjZh7g/TL1j3QlI9dI/AAAAAAAAAKs/qWDrDiQePuI/s1600/photo+(25)_2.jpg"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 320px; height: 240px;" src="http://1.bp.blogspot.com/_306eXDjZh7g/TL1j3QlI9dI/AAAAAAAAAKs/qWDrDiQePuI/s320/photo+(25)_2.jpg" border="0" alt=""id="BLOGGER_PHOTO_ID_5529685718475535826" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/_306eXDjZh7g/TL1kaiynt9I/AAAAAAAAALs/N99iemiDBNU/s1600/photo+(24)_2.jpg"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 320px; height: 240px;" src="http://1.bp.blogspot.com/_306eXDjZh7g/TL1kaiynt9I/AAAAAAAAALs/N99iemiDBNU/s320/photo+(24)_2.jpg" border="0" alt=""id="BLOGGER_PHOTO_ID_5529686324659337170" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-3661954698991679489?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://www.istanbulmarathon.org' title='Istanbul Marathon 2010'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/3661954698991679489/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=3661954698991679489' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/3661954698991679489'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/3661954698991679489'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2010/10/istanbul-marathon-2010.html' title='Istanbul Marathon 2010'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://1.bp.blogspot.com/_306eXDjZh7g/TL03B2wqSHI/AAAAAAAAAHc/bknBJIagtpw/s72-c/marathon.jpg' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-4675572702915549780</id><published>2010-09-19T12:53:00.005+03:00</published><updated>2010-11-04T07:48:54.488+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Books'/><title type='text'>Adam's Tongue by Derek Bickerton</title><content type='html'>Bickerton presents his theory on the origin of language.  Some notes: &lt;span class="fullpost"&gt;&lt;br /&gt;&lt;br /&gt;- Human language is qualitatively distinct from animal communication systems (ACSs): most words are incomplete by themselves and need to be combined to express meaning whereas animal calls are self sufficient.&lt;br /&gt;&lt;br /&gt;- A better model for origins of language may be pidgins (languages created by people that live together but do not share a common language).&lt;br /&gt;&lt;br /&gt;- A protolanguage would have words as we know them that combine but no well defined rules of syntax or morphology.&lt;br /&gt;&lt;br /&gt;- ACSs are all about the here and now, most words refer to things outside the current happenings.&lt;br /&gt;&lt;br /&gt;- ACSs are mainly manipulative whereas language mainly informative.&lt;br /&gt;&lt;br /&gt;- Communicative units come in three flavors: icons resemble the thing talked about, indices point to them, and symbols do neither. (function words which do not refer at all form a separate group).  Displacement (referring to things that are not in the here/now) is only possible with symbols and icons.  Iconic signs may have been the first displaced  ones paving the way for symbols.&lt;br /&gt;&lt;br /&gt;- "Since we usually regard language as no more than the means by which we express our thoughts, it seems natural to think that language should issue from intelligence, rather than vice versa.  It seemed equally obvious, to naive observers, that the earth was the center of the universe, and the sun, moon, and planets all went around it." pp.58&lt;br /&gt;&lt;br /&gt;- Categories are different from concepts.  pp.87. Concepts you can think about or think with, whereas all you can do with categories is to tell whether something belongs in them.  pp.205&lt;br /&gt;&lt;br /&gt;- The ACSs of ants and bees may be closer to humans because they exhibit displacement (about food sources distant in time and space). ch.7&lt;br /&gt;&lt;br /&gt;- Crucial as speciation is, it's still far from completely understood.  pp.149&lt;br /&gt;&lt;br /&gt;- The real breakthrough in language had to be displacement rather than arbitrariness (of signs).  pp.160&lt;br /&gt;&lt;br /&gt;- Bickerton's sequence:&lt;br /&gt;1. Animals have concepts that won't merge.&lt;br /&gt;2. Protohumans start talking.&lt;br /&gt;3. Talking produces typically human concepts.&lt;br /&gt;4. Merge appears and starts merging concepts.&lt;br /&gt;5. The brain maybe gets rewired.&lt;br /&gt;6. Capacities for complex thought planning etc. develop.&lt;br /&gt;&lt;br /&gt;Chomsky's version has concepts appear first and talking last.  pp.189.&lt;br /&gt;&lt;br /&gt;- Thoughts like "roses are red" are offline as opposed to online thinking about here and now.  (still makes the hearer do stuff: think of roses, think of red, merge...) pp.193&lt;br /&gt;&lt;br /&gt;- Categories have to be only detailed enough to distinguish between appropriate reactions (affordances?). pp.206.&lt;br /&gt;&lt;br /&gt;- I don't quite know what he's saying about memory pp.207 or recursion pp.244.&lt;br /&gt;&lt;br /&gt;- Posted using BlogPress from my iPhone&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-4675572702915549780?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://books.google.com/books?id=s7UqXppMR8QC&amp;dq=adam&apos;s+tongue&amp;hl=en&amp;ei=_0jSTI3pFM6cOq-NofMO&amp;sa=X&amp;oi=book_result&amp;ct=result&amp;resnum=1&amp;ved=0CCUQ6AEwAA' title='Adam&amp;#39;s Tongue by Derek Bickerton'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/4675572702915549780/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=4675572702915549780' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/4675572702915549780'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/4675572702915549780'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2010/09/adam-tongue-by-derek-bickerton.html' title='Adam&amp;#39;s Tongue by Derek Bickerton'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-2322871043277903898</id><published>2010-09-04T17:58:00.006+03:00</published><updated>2010-11-03T09:08:54.388+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Books'/><category scheme='http://www.blogger.com/atom/ns#' term='Links'/><title type='text'>Author Alerts</title><content type='html'>I have looked for a way to get alerted about new releases from my favorite authors for a long time.  I think Amazon used to support this in the past but they no longer do.  Barnes and Noble has writer alerts for only a small list of authors.  This is such an obvious feature for a bibliophile that I do not understand why nobody supports it.  I was about to write my own code but luckily ran into &lt;a href="http://www.authoralerts.com"&gt;www.authoralerts.com&lt;/a&gt; first.  Highly recommended.&lt;br /&gt;&lt;span class="fullpost"&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-2322871043277903898?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://www.authoralerts.com' title='Author Alerts'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/2322871043277903898/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=2322871043277903898' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/2322871043277903898'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/2322871043277903898'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2010/09/author-alerts.html' title='Author Alerts'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-5924358281113777728</id><published>2010-08-26T11:05:00.002+03:00</published><updated>2011-05-02T11:32:45.394+03:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Publications'/><title type='text'>Unsupervised Part of Speech Tagging Using Unambiguous Substitutes from a Statistical Language Model</title><content type='html'>Mehmet Ali Yatbaz and Deniz Yuret.  Coling 2010.  pp. 1391--1398.  Beijing, China.  (&lt;a href="http://aclweb.org/anthology-new/C/C10/C10-2159.pdf"&gt;PDF&lt;/a&gt;, &lt;a href="https://docs.google.com/uc?export=download&amp;id=0B6C4-zOYlkxsZWZlNzVkMWYtNjA0Ny00ZWNlLTk0NDUtNmYzYmExOGFmYzNh"&gt;Poster&lt;/a&gt;)&lt;span class="fullpost"&gt;&lt;br /&gt;&lt;b&gt;Abstract:&lt;/b&gt; We show that unsupervised part of speech tagging performance can be significantly improved using likely substitutes for target words given by a statistical language model.  We choose unambiguous substitutes for each occurrence of an ambiguous target word based on its context.  The part of speech tags for the unambiguous substitutes are then used to filter the entry for the target word in the word--tag dictionary. A standard HMM model trained using the filtered dictionary achieves 92.25% accuracy on a standard 24,000 word corpus.&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-5924358281113777728?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://aclweb.org/anthology-new/C/C10/C10-2159.pdf' title='Unsupervised Part of Speech Tagging Using Unambiguous Substitutes from a Statistical Language Model'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/5924358281113777728/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=5924358281113777728' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/5924358281113777728'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/5924358281113777728'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2010/08/unsupervised-part-of-speech-tagging.html' title='Unsupervised Part of Speech Tagging Using Unambiguous Substitutes from a Statistical Language Model'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-3453161494711021079</id><published>2010-07-15T21:06:00.004+03:00</published><updated>2011-05-02T11:29:35.093+03:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Publications'/><title type='text'>SemEval-2010 Task 12: Parser Evaluation using Textual Entailments</title><content type='html'>Deniz Yuret, Aydın Han, Zehra Turgut.  &lt;i&gt;Proceedings of the 5th International Workshop on Semantic Evaluation.  (&lt;a href="http://semeval2.fbk.eu/semeval2.php"&gt;SemEval-2010&lt;/a&gt;)&lt;/i&gt;  pp. 51--56.  July, 2010.  Uppsala, Sweden.  (&lt;a href="http://aclweb.org/anthology-new/S/S10/S10-1009.pdf"&gt;PDF&lt;/a&gt;, &lt;a href="http://docs.google.com/present/view?id=dc7kdh7w_31fbpc85fb"&gt;Presentation&lt;/a&gt;, &lt;a href="https://sites.google.com/a/yuret.com/pete"&gt;Task website&lt;/a&gt;, &lt;a href="http://aclweb.org/anthology-new/S/S10"&gt;Proceedings&lt;/a&gt;, &lt;a href="https://docs.google.com/viewer?a=v&amp;pid=explorer&amp;chrome=true&amp;srcid=0B6C4-zOYlkxsZDk3ZGQ4ZGItODNiYS00MzA1LWI2YjctMDIwYzdiYjM1ZDhj&amp;hl=en"&gt;Journal submission&lt;/a&gt;).&lt;br /&gt;&lt;span class="fullpost"&gt;&lt;br /&gt;&lt;iframe src="http://docs.google.com/present/embed?id=dc7kdh7w_31fbpc85fb" frameborder="0" width="410" height="342"&gt;&lt;/iframe&gt;&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Abstract:&lt;/b&gt; Parser Evaluation using Textual Entailments (PETE) is a shared task in the SemEval-2010 Evaluation Exercises on Semantic Evaluation.  The task involves recognizing textual entailments based on syntactic information alone.  PETE introduces a new parser evaluation scheme that is formalism independent, less prone to annotation error, and focused on semantically relevant distinctions.&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-3453161494711021079?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://aclweb.org/anthology-new/S/S10/S10-1009.pdf' title='SemEval-2010 Task 12: Parser Evaluation using Textual Entailments'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/3453161494711021079/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=3453161494711021079' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/3453161494711021079'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/3453161494711021079'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2010/07/semeval-2010-task-12-parser-evaluation.html' title='SemEval-2010 Task 12: Parser Evaluation using Textual Entailments'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-5865005802117095396</id><published>2010-07-15T15:39:00.002+03:00</published><updated>2010-11-03T09:08:54.390+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Publications'/><title type='text'>L1 Regularized Regression for Reranking and System Combination in Machine Translation</title><content type='html'>Ergun Bicici, Deniz Yuret.   &lt;i&gt;Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR.&lt;/i&gt;  pp. 282--289.  July 2010.  Uppsala, Sweden.  (&lt;a href="http://aclweb.org/anthology-new/W/W10/W10-1741.pdf"&gt;PDF&lt;/a&gt;, &lt;a href="https://docs.google.com/fileview?id=0B6C4-zOYlkxsZTBlMGM4N2QtOGExYS00YTdiLTk1MmQtM2QwZTY3MTVlM2Y0&amp;hl=en"&gt;Slide&lt;/a&gt;, &lt;a href="https://docs.google.com/fileview?id=0B6C4-zOYlkxsYWZmMWYwMmItYmYyNC00ZWI1LWJjZTEtYjY5MmQ0NjcyZGUz&amp;hl=en"&gt;Poster&lt;/a&gt;)&lt;br /&gt;&lt;span class="fullpost"&gt;&lt;br /&gt;&lt;b&gt;Abstract:&lt;/b&gt;  We use L1 regularized transductive regression to learn mappings between source and target features of the training sets derived for each test sentence and use these mappings to rerank translation outputs.  We compare the effectiveness of L1 regularization techniques for regression to learn mappings between features given in a sparse feature matrix.  The results show the effectiveness of using L1 regularization versus L2 used in ridge regression.  We show that regression mapping is effective in reranking translation outputs and in selecting the best system combinations with encouraging results on different language pairs.&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-5865005802117095396?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://aclweb.org/anthology-new/W/W10/W10-1741.pdf' title='L1 Regularized Regression for Reranking and System Combination in Machine Translation'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/5865005802117095396/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=5865005802117095396' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/5865005802117095396'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/5865005802117095396'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2010/07/l1-regularized-regression-for-reranking.html' title='L1 Regularized Regression for Reranking and System Combination in Machine Translation'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-3151342673915320817</id><published>2010-04-22T11:32:00.012+03:00</published><updated>2010-11-03T09:08:54.391+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Books'/><title type='text'>The Trouble with Physics by Lee Smolin</title><content type='html'>Smolin's book made me think of a dilemma I face often.  I find the current system of scientific funding disturbing.  Chief among the "values" of a scientist is absolute honesty.  Yet, the project proposals we need to fill periodically ask us to describe what we are going to do in detail for the next three years.  I don't know what I am going to do during the next three weeks!  It depends on what results I am going to get using my current approach during the next couple of days.  Maybe I will have a brilliant idea that will change my whole approach to the problem.  Maybe I will be taken over with another problem.  Honestly I don't know.  The only thing I can promise is that I will put all my working energy on making progress on the problem that I find most promising at the time.  But apparently that is not enough to get funding, and we are forced to either (i) bend the truth, or (ii) tie ourselves to an approach that we will most likely find suboptimal in the near future. &lt;span class="fullpost"&gt;&lt;br /&gt;&lt;br /&gt;To me the answer is simple: scientists should be funded not on promises about the future (which nobody can honestly make, let alone scientists whose job is to explore the unknown), but on past performance.  That leaves the problem of young scientists who have no past.  There should be a reasonable amount of seed funding for such people, just enough to make sure an adventurous spirit has enough time to risk his career tackling an important and deep problem.&lt;br /&gt;&lt;br /&gt;Smolin's book should be required reading by all who manage scientists and scientific funding.  If you are not interested in the debate on string theory, just read the last few chapters on how science works based on a shared ethic, and why we should take a bit more risk on "Seers" who tend to obsess about high risk problems and may take a long time (sometimes forever) producing anything valuable. &lt;br /&gt;&lt;br /&gt;Chapter 17 proposes the shared ethic among scientists rather than some abstract "scientific method" as chiefly responsible for the success of science.  Chapter 18 draws a distinction between two types of scientists "Seers" and "Craftspeople".  In fact pretty much the whole book is an elaboration of how and why the scientific establishment does not provide enough room for "Seers" who by nature like to obsess about high risk problems and need much longer incubation times.&lt;br /&gt;&lt;br /&gt;I find the shared ethic of science to be one of the most important creations of human culture.  I had long held the view that science was about "what is" and not about "what ought to be", thus science and ethics had nothing to do each other.  Recently I started to see the ethic of scientists as people, if not the result of their work, as being very relevant.  &lt;a href="http://www.nytimes.com/2009/01/27/science/27essa.html?_r=1"&gt;Dennis Overbye&lt;/a&gt; describes it best:&lt;br /&gt;&lt;br /&gt;"Not only does science not provide any values of its own, say its detractors, it also undermines the ones we already have, devaluing anything it can’t measure, reducing sunsets to wavelengths and romance to jiggly hormones. It destroys myths and robs the universe of its magic and mystery.  So the story goes.  But this is balderdash. Science is not a monument of received Truth but something that people do to look for truth.  That endeavor, which has transformed the world in the last few centuries, does indeed teach values. Those values, among others, are honesty, doubt, respect for evidence, openness, accountability and tolerance and indeed hunger for opposing points of view."&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-3151342673915320817?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://www.thetroublewithphysics.com/' title='The Trouble with Physics by Lee Smolin'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/3151342673915320817/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=3151342673915320817' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/3151342673915320817'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/3151342673915320817'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2010/04/trouble-with-physics-by-lee-smolin.html' title='The Trouble with Physics by Lee Smolin'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-3361565282568144099</id><published>2010-02-23T17:51:00.003+02:00</published><updated>2010-11-03T08:24:49.991+02:00</updated><title type='text'>Biçimbilimsel Çözümleme</title><content type='html'>*********************************************&lt;br /&gt;KOC UNIVERSITY&lt;br /&gt;ELECTRICAL AND COMPUTER ENGINEERING&lt;br /&gt;ECOE 590 SEMINAR&lt;br /&gt;*********************************************&lt;br /&gt;&lt;br /&gt;Date : 23 February 2010, Tuesday&lt;br /&gt;Time : 17:00&lt;br /&gt;Place : ENG B29&lt;br /&gt;Title : Biçimbilimsel Çözümleme&lt;br /&gt;Speaker : Gülşen Cebiroğlu Eryiğit&lt;br /&gt;Download : &lt;a href="https://docs.google.com/fileview?id=0B6C4-zOYlkxsZjA2NWNiMWEtNDVlYS00NWE4LTllNTAtMWRkYzdlZjZlNjIx&amp;hl=en"&gt;PDF&lt;/a&gt;&lt;br /&gt;&lt;span class="fullpost"&gt;&lt;br /&gt;Doğal Dil İşleme’nin en temel seviyelerinden biri olan biçimbilimsel çözümleme, bir sözcüğün yapısının bilgisayarlar tarafından otomatik olarak çözümlenmesi işlemidir. Biçimbilimsel çözümleme işlemi sonucunda bir sözcüğün en küçük anlamlı birimleri olan morfemlerin (biçimbirimlerin) bulunması ve sözcük yapısının çözümlenmesi hedeflenmektedir. Örneğin “arabalar” sözcüğünün gövdesinin “araba” olduğu ve bu sözcüğün çoğul eki almış bir isim olduğunun otomatik olarak belirlenmesi bir biçimbilimsel çözümleme işlemidir. İşlem sırasında, sözcüğü oluşturan morfemlerin birbirlerinden ayrılmasından yola çıkılarak, bu işleme aynı zamanda Biçimbilimsel Ayrıştırma adı da verilmektedir.&lt;br /&gt;Bu konuşmada, biçimbilimsel çözümleme konusu ayrıntılı olarak ele alınacak, kullanım alanları, genel yaklaşımlar ve Türkçe'nin biçimbilimsel çözümlemesi konusunda bilgi verilecektir.&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-3361565282568144099?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='https://docs.google.com/fileview?id=0B6C4-zOYlkxsZjA2NWNiMWEtNDVlYS00NWE4LTllNTAtMWRkYzdlZjZlNjIx&amp;hl=en' title='Biçimbilimsel Çözümleme'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/3361565282568144099/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=3361565282568144099' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/3361565282568144099'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/3361565282568144099'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2010/02/bicimbilimsel-cozumleme.html' title='Biçimbilimsel Çözümleme'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-8662638035923847732</id><published>2010-02-22T23:13:00.008+02:00</published><updated>2010-11-03T09:08:54.392+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Notes'/><title type='text'>CFP: Parser Evaluation using Textual Entailments (PETE)</title><content type='html'>SemEval-2010 Shared Task #12&lt;br /&gt;Parser Evaluation using Textual Entailments (PETE)&lt;br /&gt;(&lt;a href="http://pete.yuret.com/"&gt;http://pete.yuret.com&lt;/a&gt;)&lt;br /&gt;&lt;br /&gt;The purpose of this post is to encourage participation in the task "Parser Evaluation using Textual Entailments" in the 5th International Workshop on Semantic Evaluations, SemEval-2010 (&lt;a href="http://semeval2.fbk.eu/semeval2.php"&gt;http://semeval2.fbk.eu/semeval2.php&lt;/a&gt;) collocated with ACL-2010, July&lt;br /&gt;15-16, Uppsala.&lt;br /&gt;&lt;br /&gt;This shared task should be of interest to researchers working on&lt;br /&gt;  * parsing&lt;br /&gt;  * semantic role labeling&lt;br /&gt;  * recognizing textual entailments&lt;br /&gt;&lt;span class="fullpost"&gt;&lt;br /&gt;&lt;div dir="ltr"&gt;&lt;div&gt;&lt;p&gt;Parser Evaluation using Textual Entailments (PETE) is a &lt;a href="http://semeval2.fbk.eu/semeval2.php?location=tasks&amp;amp;id=17" rel="nofollow"&gt;&lt;font color="#666154"&gt;shared task&lt;/font&gt;&lt;/a&gt; in the &lt;a href="http://semeval2.fbk.eu" rel="nofollow"&gt;&lt;font color="#666154"&gt;SemEval-2010&lt;/font&gt;&lt;/a&gt; Evaluation Exercises on Semantic Evaluation.  The task involves &lt;font color="#666154"&gt;&lt;a href="http://www.nist.gov/tac/2009/RTE" rel="nofollow"&gt;recognizing textual entailments&lt;/a&gt;&lt;/font&gt; (RTE) based on syntactic information.  Given two text fragments called 'Text' and 'Hypothesis', Textual Entailment Recognition is the task of determining whether the meaning of the Hypothesis is entailed (can be inferred) from the Text.  The PETE task focuses on entailments that can be inferred using syntactic information alone.&lt;/p&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;Text: The man with the hat was tired.&lt;/li&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;Hypothesis-1: The man was tired. (YES)&lt;/li&gt;&lt;br /&gt;&lt;br /&gt;&lt;li&gt;Hypothesis-2: The hat was tired. (NO)&lt;/li&gt;&lt;/ul&gt;&lt;/ul&gt;&lt;br /&gt;&lt;/div&gt;&lt;br /&gt;Our goals in introducing this task are:&lt;br /&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;To focus parser evaluation on semantically relevant phenomena.&lt;/li&gt;&lt;br /&gt;&lt;li&gt;To introduce a parser evaluation scheme that is formalism independent.&lt;/li&gt;&lt;br /&gt;&lt;li&gt;To introduce a &lt;i&gt;targeted&lt;/i&gt; textual entailment task focused on a single linguistic competence.&lt;br /&gt;  &lt;/li&gt;&lt;br /&gt;&lt;li&gt;To be able to collect high quality evaluation data from untrained annotators.&lt;/li&gt;&lt;/ul&gt;&lt;br /&gt;The following criteria were used when constructing the entailments:&lt;br /&gt;&lt;br /&gt;&lt;div&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;They should be decidable using only syntactic inference.&lt;/li&gt;&lt;br /&gt;&lt;li&gt;They should be easy to decide by untrained annotators.&lt;/li&gt;&lt;br /&gt;&lt;li&gt;They should be challenging for state of the art parsers.&lt;/li&gt;&lt;/ul&gt;&lt;br /&gt;You can find more details about our entailment generation process in the &lt;a href="http://pete.yuret.com/guide"&gt;PETE Guide&lt;/a&gt;.  You can download the development and test datasets including gold answers and system scores here: &lt;a href="http://pete.yuret.com/docs/PETE_gold.zip?attredirects=0"&gt;PETE_gold.zip&lt;/a&gt;.  There is no training data.  The evaluation is similar to other &lt;a href="http://www.nist.gov/tac/2009/RTE" rel="nofollow"&gt;RTE tasks&lt;/a&gt;.  There is a Google group &lt;a href="http://groups.google.com/group/semeval-pete"&gt;semeval-pete&lt;/a&gt; for task related messages.&lt;/div&gt;&lt;br /&gt;&lt;br /&gt;&lt;div&gt;&lt;br /&gt;&lt;br /&gt;&lt;/div&gt;&lt;br /&gt;&lt;div&gt;&lt;font color="#000000" face="arial, sans-serif"&gt;&lt;span style="border-collapse:collapse"&gt;&lt;b&gt;Instructions:&lt;/b&gt;&lt;/span&gt;&lt;/font&gt;&lt;/div&gt;&lt;div&gt;&lt;ul&gt;&lt;li style="list-style-position:outside;list-style-type:circle"&gt;&lt;span style="color:rgb(0, 0, 0);font-family:arial,sans-serif;border-collapse:collapse"&gt;&lt;b&gt;&lt;div style="display:inline ! important"&gt;&lt;span style="font-weight:normal"&gt;join the mailing list (&lt;a href="http://groups.google.com/group/semeval-pete"&gt;http://groups.google.com/group/semeval-pete&lt;/a&gt;)&lt;/span&gt;&lt;/div&gt;&lt;/b&gt;&lt;/span&gt;&lt;/li&gt;&lt;li style="list-style-position:outside;list-style-type:circle"&gt;&lt;span style="color:rgb(0, 0, 0);font-family:arial,sans-serif;border-collapse:collapse"&gt;&lt;b&gt;&lt;div style="display:inline ! important"&gt;&lt;span style="font-weight:normal"&gt;register in SemEval website (&lt;a href="http://semeval2.fbk.eu" rel="nofollow"&gt;http://semeval2.fbk.eu&lt;/a&gt;)&lt;/span&gt;&lt;/div&gt;&lt;/b&gt;&lt;/span&gt;&lt;/li&gt;&lt;li style="list-style-position:outside;list-style-type:circle"&gt;&lt;span style="color:rgb(0, 0, 0);font-family:arial,sans-serif;border-collapse:collapse"&gt;&lt;b&gt;&lt;div style="display:inline ! important"&gt;&lt;span style="font-weight:normal"&gt;download the development (trial) data from SemEval website (&lt;a href="http://semeval2.fbk.eu" rel="nofollow"&gt;http://semeval2.fbk.eu&lt;/a&gt;)&lt;/span&gt;&lt;/div&gt;&lt;/b&gt;&lt;/span&gt;&lt;/li&gt;&lt;li style="list-style-position:outside;list-style-type:circle"&gt;&lt;span style="color:rgb(0, 0, 0);font-family:arial,sans-serif;border-collapse:collapse"&gt;&lt;b&gt;&lt;div style="display:inline ! important"&gt;&lt;span style="font-weight:normal"&gt;download task guide from task website (&lt;a href="http://pete.yuret.com/guide" rel="nofollow"&gt;http://pete.yuret.com/guide&lt;/a&gt;)&lt;/span&gt;&lt;/div&gt;&lt;/b&gt;&lt;/span&gt;&lt;/li&gt;&lt;li style="list-style-position:outside;list-style-type:circle"&gt;&lt;span style="color:rgb(0, 0, 0);font-family:arial,sans-serif;border-collapse:collapse"&gt;&lt;b&gt;&lt;div style="display:inline ! important"&gt;&lt;span style="font-weight:normal"&gt;&lt;b&gt;&lt;div style="display:inline ! important"&gt;&lt;span style="font-weight:normal"&gt;download test data from SemEval website (&lt;a href="http://semeval2.fbk.eu" rel="nofollow"&gt;http://semeval2.fbk.eu&lt;/a&gt;)&lt;/span&gt;&lt;/div&gt;&lt;/b&gt;&lt;/span&gt;&lt;/div&gt;&lt;/b&gt;&lt;/span&gt;&lt;/li&gt;&lt;li style="list-style-position:outside;list-style-type:circle"&gt;&lt;span style="color:rgb(0, 0, 0);font-family:arial,sans-serif;border-collapse:collapse"&gt;&lt;b&gt;&lt;div style="display:inline ! important"&gt;&lt;span style="font-weight:normal"&gt;upload results to SemEval website (&lt;a href="http://semeval2.fbk.eu" rel="nofollow"&gt;http://semeval2.fbk.eu&lt;/a&gt;)&lt;/span&gt;&lt;/div&gt;&lt;/b&gt;&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;Important Dates:&lt;/b&gt;&lt;/div&gt;&lt;br /&gt;&lt;br /&gt;&lt;div&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;strike&gt;&lt;span style="color:rgb(0, 0, 0);font-family:arial,sans-serif;border-collapse:collapse"&gt;February 19 - the development (trial) data available.&lt;/span&gt;&lt;/strike&gt;&lt;/li&gt;&lt;br /&gt;&lt;li&gt;&lt;strike&gt;&lt;span style="color:rgb(0, 0, 0);font-family:arial,sans-serif;border-collapse:collapse"&gt;March 26 - the test data available.&lt;/span&gt;&lt;/strike&gt;&lt;/li&gt;&lt;br /&gt;&lt;li&gt;&lt;strike&gt;&lt;span style="color:rgb(0, 0, 0);font-family:arial,sans-serif;border-collapse:collapse"&gt;April 2 - end of submission period for the task.&lt;/span&gt;&lt;/strike&gt;&lt;/li&gt;&lt;br /&gt;&lt;li&gt;&lt;span style="color:rgb(0, 0, 0);font-family:arial,sans-serif;border-collapse:collapse"&gt;April 17 - Submission of description papers.&lt;/span&gt;&lt;/li&gt;&lt;br /&gt;&lt;li&gt;&lt;span style="color:rgb(0, 0, 0);font-family:arial,sans-serif;border-collapse:collapse"&gt;May 6 - Notification of acceptance.&lt;/span&gt;&lt;/li&gt;&lt;br /&gt;&lt;li&gt;&lt;span style="color:rgb(0, 0, 0);font-family:arial,sans-serif;border-collapse:collapse"&gt;July 15-16 - Workshop at ACL 2010, Uppsala.&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;&lt;font color="#000000" face="arial, sans-serif"&gt;&lt;span style="border-collapse:collapse"&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;/span&gt;&lt;/font&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;div&gt;&lt;b&gt;Parsers:&lt;br /&gt;&lt;/b&gt;&lt;p&gt;Here are some links for publicly available parsers that can be used in this task.  You do not have to use any of these parsers, in fact you do not have to use a conventional parsing algorithm at all -- outside the box approaches are highly encouraged.  However, to get a quick baseline system using an existing parser may be a good way to start.&lt;/p&gt;&lt;ul&gt;&lt;li&gt;&lt;a href="http://code.google.com/p/berkeleyparser/"&gt;Berkeley Parser&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.cis.upenn.edu/%7Edbikel/software.html#stat-parser" rel="nofollow"&gt;Bikel Parser&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://svn.ask.it.usyd.edu.au/trac/candc/wiki" rel="nofollow"&gt;C&amp;amp;C CCG Parser&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://people.csail.mit.edu/mcollins/code.html" rel="nofollow"&gt;Collins Parser&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.cs.brown.edu/people/ec/#software" rel="nofollow"&gt;Charniak Parser&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.link.cs.cmu.edu/link/" rel="nofollow"&gt;CMU Link Parser&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://sites.google.com/site/desrparser/"&gt;DeSR Parser&lt;/a&gt;&lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://www-tsujii.is.s.u-tokyo.ac.jp/enju/" rel="nofollow"&gt;Enju Parser&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://maltparser.org/" rel="nofollow"&gt;MaltParser&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://webdocs.cs.ualberta.ca/%7Elindek/minipar.htm" rel="nofollow"&gt;Minipar&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://maltparser.org/" rel="nofollow"&gt;MSTParser&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.informatics.sussex.ac.uk/research/groups/nlp/rasp/" rel="nofollow"&gt;RASP Parser&lt;/a&gt;&lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://nlp.stanford.edu/downloads/lex-parser.shtml" rel="nofollow"&gt;Stanford Parser&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://pete.yuret.com/docs/conll-entailments.pl?attredirects=0"&gt;conll-entailments.pl&lt;/a&gt;: This is not a parser but a simple script to illustrate how short entailments may be generated from a dependency parse.  Incomplete and buggy, use at your own risk.&lt;br /&gt;&lt;/li&gt;&lt;/ul&gt;&lt;b&gt;&lt;br /&gt;Further Reading:&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;a href="http://pete.yuret.com/guide"&gt;&lt;i&gt;PETE Guide&lt;/i&gt;&lt;/a&gt;&lt;i&gt;: A description of the entailment generation process (February, 2010).&lt;/i&gt;&lt;/li&gt;&lt;li&gt;&lt;i&gt;&lt;span style="font-style:normal"&gt;&lt;a href="http://aclweb.org/anthology-new/D/D09/D09-1085.pdf" rel="nofollow" style="color:rgb(141, 161, 173);text-decoration:underline"&gt;&lt;i&gt;D09-1085.pdf&lt;/i&gt;&lt;/a&gt;&lt;i&gt;: Rimell, L., S. Clark, and M. Steedman.  Unbounded Dependency Recovery for Parser Evaluation.  Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing (August, 2009).&lt;/i&gt;&lt;/span&gt;&lt;/i&gt;&lt;/li&gt;&lt;li&gt;&lt;i&gt;&lt;span style="font-style:normal"&gt;&lt;a href="http://pete.yuret.com/docs/thesis.pdf?attredirects=0" style="color:rgb(141, 161, 173);text-decoration:underline"&gt;&lt;i&gt;thesis.pdf&lt;/i&gt;&lt;/a&gt;&lt;i&gt;: Onder Eker's MS thesis (August, 2009).&lt;/i&gt;&lt;/span&gt;&lt;/i&gt;&lt;/li&gt;&lt;br /&gt;&lt;li&gt;&lt;a href="http://pete.yuret.com/docs/semeval-abstract.pdf?attredirects=0"&gt;&lt;i&gt;semeval-abstract.pdf&lt;/i&gt;&lt;/a&gt;&lt;i&gt;: The PETE task abstract (December, 2008).&lt;/i&gt;&lt;/li&gt;&lt;li&gt;&lt;i&gt;&lt;span style="font-style:normal"&gt;&lt;a href="http://pete.yuret.com/docs/pete.pdf?attredirects=0" style="color:rgb(141, 161, 173);text-decoration:underline"&gt;&lt;i&gt;pete.pdf&lt;/i&gt;&lt;/a&gt;&lt;i&gt;: The initial PETE task proposal (September, 2008).&lt;/i&gt;&lt;/span&gt;&lt;/i&gt;&lt;/li&gt;&lt;li&gt;&lt;i&gt;&lt;span style="font-style:normal"&gt;&lt;i&gt;&lt;a href="http://lingo.stanford.edu/events/08/pe/" rel="nofollow"&gt;Workshop on Cross-Framework and Cross-Domain Parser Evaluation&lt;/a&gt; (August, 2008)&lt;br /&gt;&lt;/i&gt;&lt;/span&gt;&lt;/i&gt;&lt;/li&gt;&lt;li&gt;&lt;i&gt;&lt;span style="font-style:normal"&gt;&lt;i&gt;&lt;span style="font-style:normal"&gt;&lt;i&gt;&lt;a href="http://nlp.stanford.edu/manning/papers/natlog-wtep07-final.pdf" rel="nofollow" style="color:rgb(141, 161, 173);text-decoration:underline"&gt;natlog-wtep07-final.pdf&lt;/a&gt;: Bill MacCartney and Christopher D. Manning. 2007. Natural logic for textual inference. ACL-PASCAL Workshop on Textual Entailment and Paraphrasing, pp. 193-200. (June, 2007).&lt;/i&gt;&lt;/span&gt;&lt;/i&gt;&lt;/span&gt;&lt;/i&gt;&lt;/li&gt;&lt;li&gt;&lt;i&gt;&lt;span style="font-style:normal"&gt;&lt;i&gt;&lt;span style="font-style:normal"&gt;&lt;i&gt;&lt;span style="font-style:normal"&gt;&lt;i&gt;&lt;a href="http://denizyuret.blogspot.com/2007/06/targeted-textual-entailments-proposal.html" style="color:rgb(141, 161, 173);text-decoration:underline"&gt;targeted textual entailments&lt;/a&gt;: On targeted textual entailments in general (June 2007).&lt;/i&gt;&lt;/span&gt;&lt;/i&gt;&lt;/span&gt;&lt;/i&gt;&lt;/span&gt;&lt;/i&gt;&lt;/li&gt;&lt;br /&gt;&lt;br /&gt;&lt;li&gt;&lt;a href="http://denizyuret.blogspot.com/2006/10/why-you-should-not-use-penn-treebank.html"&gt;&lt;i&gt;a blog post&lt;/i&gt;&lt;/a&gt;&lt;i&gt;: On the consistency of Penn Treebank annotation (October, 2006).&lt;/i&gt;&lt;/li&gt;&lt;li&gt;&lt;i&gt;&lt;a href="http://www.cogs.susx.ac.uk/lab/nlp/carroll/papers/lre98.pdf" rel="nofollow"&gt;lre98.pdf&lt;/a&gt;: Carroll, J., E. Briscoe and A. Sanfilippo (1998) `Parser evaluation: a survey and a new proposal'. In Proceedings of the 1st International Conference on Language Resources and Evaluation, Granada, Spain. 447-454.&lt;/i&gt;&lt;/li&gt;&lt;/ul&gt;&lt;br /&gt;&lt;b&gt;Contact:&lt;/b&gt;&lt;/div&gt;&lt;br /&gt;&lt;div&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;Deniz Yuret  dyuret@ku.edu.tr&lt;/li&gt;&lt;/ul&gt;&lt;br /&gt;&lt;/div&gt;&lt;br /&gt;&lt;/div&gt;&lt;/div&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-8662638035923847732?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://pete.yuret.com' title='CFP: Parser Evaluation using Textual Entailments (PETE)'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/8662638035923847732/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=8662638035923847732' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/8662638035923847732'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/8662638035923847732'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2010/02/cfp-parser-evaluation-using-textual.html' title='CFP: Parser Evaluation using Textual Entailments (PETE)'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-5784000721930286313</id><published>2010-02-21T22:27:00.003+02:00</published><updated>2010-11-03T08:27:27.304+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Publications'/><title type='text'>Preprocessing with Linear Transformations that Maximize the Nearest  Neighbor Classiﬁcation Accuracy</title><content type='html'>Mehmet Ali Yatbaz and Deniz Yuret.  &lt;i&gt;1st CSE Student Workshop (CSW’10)&lt;/i&gt;, 21 February 2010, Koc Istinye Campus, Istanbul.  (&lt;a href="https://docs.google.com/fileview?id=0B6C4-zOYlkxsN2Q2YzZlYTAtM2Q5NC00YmI4LTk4ZmEtYzY2ZjM3NTAxYmY4&amp;hl=en"&gt;PDF&lt;/a&gt;, &lt;a href="http://docs.google.com/present/view?id=d2jm3f3_1282d35pn9hs"&gt;PPT&lt;/a&gt;)&lt;span class="fullpost"&gt;&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Abstract&lt;/b&gt;&lt;br /&gt;We introduce a preprocessing technique for classification problems based on linear transformations. The algorithm incrementally constructs a linear transformation that maximizes the nearest neighbor classification accuracy on the training set. At each iteration the algorithm picks a point in the dataset, and computes a transformation &lt;br /&gt;that moves the point closer to points in its own class and/or away from points in other classes. The composition of the resulting linear transformations lead to statistically significant improvements in instance based learning algorithms.&lt;br /&gt;&lt;br /&gt;&lt;iframe src="http://docs.google.com/present/embed?id=d2jm3f3_1282d35pn9hs" frameborder="0" width="410" height="342"&gt;&lt;/iframe&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-5784000721930286313?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='https://docs.google.com/fileview?id=0B6C4-zOYlkxsN2Q2YzZlYTAtM2Q5NC00YmI4LTk4ZmEtYzY2ZjM3NTAxYmY4&amp;hl=en' title='Preprocessing with Linear Transformations that Maximize the Nearest  Neighbor Classiﬁcation Accuracy'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/5784000721930286313/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=5784000721930286313' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/5784000721930286313'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/5784000721930286313'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2010/02/preprocessing-with-linear.html' title='Preprocessing with Linear Transformations that Maximize the Nearest  Neighbor Classiﬁcation Accuracy'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-4607096368394312554</id><published>2010-02-21T18:58:00.002+02:00</published><updated>2010-02-23T19:09:01.203+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Publications'/><title type='text'>L1 Regularization for Learning Word Alignments in Sparse Feature Matrices</title><content type='html'>Ergun Bicici and Deniz Yuret.  &lt;i&gt;1st CSE Student Workshop (CSW’10)&lt;/i&gt;, 21 February 2010, Koc Istinye Campus, Istanbul.  (&lt;a href="https://docs.google.com/fileview?id=0B6C4-zOYlkxsMWMyM2ViNTEtZmQ0NS00MTQ4LWJlYzEtNjJlMzhhNWEyYWQ0&amp;hl=en"&gt;PDF&lt;/a&gt;, &lt;a href="https://docs.google.com/fileview?id=0B6C4-zOYlkxsNGQ4MTk1YzItODkyYS00Y2JmLTkzMDgtOWQ0MGMzYWQ2M2Ri&amp;hl=en"&gt;Poster&lt;/a&gt;)&lt;span class="fullpost"&gt;&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Abstract&lt;/b&gt;&lt;br /&gt;Sparse feature representations can be used in various domains. We compare the effectiveness of $L_1$ regularization techniques for regression to learn mappings between features given in a sparse feature matrix. We apply these techniques for learning word alignments commonly used for machine translation. The performance of the learned mappings are measured using the phrase table generated on a larger corpus by a state of the art word aligner. The results show the effectiveness of using $L_1$ regularization versus $L_2$ used in ridge regression.&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-4607096368394312554?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='https://docs.google.com/fileview?id=0B6C4-zOYlkxsMWMyM2ViNTEtZmQ0NS00MTQ4LWJlYzEtNjJlMzhhNWEyYWQ0&amp;hl=en' title='L1 Regularization for Learning Word Alignments in Sparse Feature Matrices'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/4607096368394312554/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=4607096368394312554' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/4607096368394312554'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/4607096368394312554'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2010/02/l1-regularization-for-learning-word.html' title='L1 Regularization for Learning Word Alignments in Sparse Feature Matrices'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-6012833724780545114</id><published>2010-02-19T09:17:00.003+02:00</published><updated>2011-05-10T23:37:57.998+03:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Publications'/><title type='text'>The Noisy Channel Model for Unsupervised Word Sense Disambiguation</title><content type='html'>Deniz Yuret and Mehmet Ali Yatbaz.  &lt;i&gt;Computational Linguistics, &lt;a href="http://www.aclweb.org/anthology-new/J/J10"&gt;Volume 36, Number 1&lt;/a&gt;, March 2010.&lt;/i&gt;  (&lt;a href="http://www.mitpressjournals.org/doi/abs/10.1162/coli.2010.36.1.36103"&gt;Abstract&lt;/a&gt;, &lt;a href="http://www.aclweb.org/anthology-new/J/J10/J10-1004.pdf"&gt;PDF&lt;/a&gt;)&lt;span class="fullpost"&gt;&lt;br /&gt;&lt;br /&gt;&lt;iframe src="https://docs.google.com/present/embed?id=d2jm3f3_18964v3fzcf2" frameborder="0" width="410" height="342"&gt;&lt;/iframe&gt;&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Abstract:&lt;/b&gt; We introduce a generative probabilistic model, the noisy channel model, for unsupervised word sense disambiguation.  In our model, each context C is modeled as a distinct channel through which the speaker intends to transmit a particular meaning S using a possibly ambiguous word W.  To reconstruct the intended meaning the hearer uses the distribution of possible meanings in the given context P(S|C) and possible words that can express each meaning P(W|S).  We assume P(W|S) is independent of the context and estimate it using WordNet sense frequencies.  The main problem of unsupervised WSD is estimating context dependent P(S|C) without access to any sense tagged text.  We show one way to solve this problem using a statistical language model based on large amounts of untagged text.  Our model uses coarse-grained semantic classes for S internally and we explore the effect of using different levels of granularity on WSD performance.  The system outputs fine grained senses for evaluation and its performance on noun disambiguation is better than most previously reported unsupervised systems and close to the best supervised systems. &lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-6012833724780545114?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/6012833724780545114/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=6012833724780545114' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/6012833724780545114'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/6012833724780545114'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2009/09/noisy-channel-model-for-unsupervised.html' title='The Noisy Channel Model for Unsupervised Word Sense Disambiguation'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-116336216789036689</id><published>2010-02-17T22:04:00.002+02:00</published><updated>2011-04-06T14:33:45.147+03:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Downloads'/><title type='text'>Emacs Turkish mode</title><content type='html'>This is for people trying to type Turkish documents on a U.S. keyboard using Emacs.  The program provides a turkish-mode in which the correct Turkish accents are added to the ascii version of the last word typed each time the user hits space.  The latest version is available &lt;a href="https://docs.google.com/uc?id=0B6C4-zOYlkxsYzliNjVjNjEtMDBhMC00MGU0LTg2ODQtYTc1NjhjOGYyODcw&amp;export=download&amp;hl=en"&gt; here&lt;/a&gt;.&lt;span class="fullpost"&gt;&lt;br /&gt;&lt;br /&gt;It was inspired by Gökhan Tür's deasciifier:&lt;br /&gt;&lt;a href="http://www.hlst.sabanciuniv.edu/TL/deascii.html"&gt;http://www.hlst.sabanciuniv.edu/TL/deascii.html&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;The program uses decision lists (included at the end of this file) which was created based on 1 million words of Turkish news text using the GPA algorithm. For more information on GPA see the &lt;a href="http://denizyuret.blogspot.com/2006/11/greedy-prepend-algorithm-for-decision.html"&gt;Greedy prepend algorithm for decision list induction&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;To activate the program first load this file into emacs:&lt;br /&gt;M-x load-file ENTER turkish.el ENTER&lt;br /&gt;Then turn on the turkish mode:&lt;br /&gt;M-x turkish-mode&lt;br /&gt;&lt;br /&gt;When Turkish mode is enabled, the space, tab, and enter keys correct the previous word by adding Turkish accents. For corrections use C-t to toggle the accent of the character under the cursor.&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-116336216789036689?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://deniz.yuret.com/turkish/turkish.el' title='Emacs Turkish mode'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/116336216789036689/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=116336216789036689' title='28 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/116336216789036689'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/116336216789036689'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2006/11/emacs-turkish-mode.html' title='Emacs Turkish mode'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>28</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-6538535957097277009</id><published>2010-02-14T16:26:00.014+02:00</published><updated>2010-11-03T09:08:54.394+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Türkçe'/><title type='text'>Permutasyon, Kombinasyon ve 12 kardeşleri</title><content type='html'>Bilgisayar Olimpiyat Kampından Notlar I&lt;br /&gt;&lt;br /&gt;Temel sayma problemlerinin çoğu iki sonlu kümenin elemanları arasında kaç çeşit eşleme yapılabileceği sorularına indirgenebilir.  Örneğin bir basketbol takımının K oyuncusunu bir otelin N odasına yerleştirmeye çalıştırdığımızı düşünelim.  Öncelikle bir odada kaç kişi kalabileceği sorusuna göre üç farklı tip problem tanımlayabiliriz: &lt;span class="fullpost"&gt;&lt;br /&gt;&lt;br /&gt;A. Bir odada herhangi sayıda oyuncu kalabilir.&lt;br /&gt;B. Bir odada en çok bir kişi kalmalı (K &lt;= N).&lt;br /&gt;C. Bir odada en az bir kişi kalmalı (K &gt;= N).&lt;br /&gt;&lt;br /&gt;Diğer yandan K ya da N küme elemanlarının kimliklerinin önemli olup olmadığı, ya da permutasyonlarının (değişik sıralamalarının) farklı çözüm olarak sayılıp sayılmadığı da bize dört farklı tip problem verir.  Bu problem tiplerini değişik objelerle örneklendirecek olursak:&lt;br /&gt;&lt;br /&gt;0. K ve N elemanlarının kimlikleri önemli.  (K kişiyi N odaya dağıtıyoruz)&lt;br /&gt;1. K'lar birbiriyle eş, N'ler farklı.  (K topu N odaya dağıtıyoruz)&lt;br /&gt;2. K'lar farklı, N'ler birbiriyle eş.  (K kişiyi N takıma ayırıyoruz)&lt;br /&gt;3. K'lar da N'ler de birbiriyle eş.  (K topu N gruba ayırıyoruz)&lt;br /&gt;&lt;br /&gt;Burada birbiriyle eşlenmesi daha kolay olduğu için kişi yerine top, oda yerine takım ya da grup kullandık.  (0) probleminde kimin hangi odada kaldığı önemliyken, (1) probleminde sadece hangi odada kaç top olduğu önemli.  (2) probleminde ABCD isminde dört kişiyi (AB) ve (CD) gibi iki takıma ayırmakla (CD) ve (AB) gibi iki takıma ayırmak aynı çözüm sayılıyor.  (3) probleminde ise ne topların ne grupların kimliği önemli, burada örneğin 6 topu 3+2+1 olarak ayırmak ile 4+1+1 olarak ayırmak farklı çözümler sayılsa da 3+2+1 ile 1+2+3 aynı çözüm sayılıyor.&lt;br /&gt;&lt;br /&gt;Matematikçi Gian-Carlo Rota bu seçenekleri göz önüne alarak temel sayma problemlerini A0, A1, ..., C2, C3 şeklinde &lt;a href="http://en.wikipedia.org/wiki/Twelvefold_way"&gt;12 kategoride&lt;/a&gt; sınıflandırmayı öneriyor.  Örneğin lisede öğrendiğimiz permutasyon B0, kombinasyon B1 problemlerine denk geliyor.  Malesef bu sınıflamanın dışında kalan Catalan objelerini, permutasyon tiplerini, ağaç tiplerini sayma problemleri de var fakat onları başka bir notta ele alırız.&lt;br /&gt;&lt;br /&gt;Bu problemlerin  bazıları şu teknikle çözülebilir: saymamız istenilen çözümlerden herhangi birini oluşturmak için K aşama tanımlayalım.   i'nci aşamada $n_i$ potansiyel farklı seçenek olsun.  İzleyebileceğimiz farklı yol sayısı bu durumda $n_1 n_2 \ldots n_k$ çarpımı olarak ifade edilebilir.  Hemen bir iki örnek:&lt;br /&gt;&lt;br /&gt;A0. K kişiyi N odaya herhangi bir sınırlama olmadan dağıtmak için her kişiye tek tek hangi odayı istediklerini sorarız.  K sorunun (aşamanın) her birine N farklı cevap (seçenek) gelme olasılığı var.  Dolayısıyla potansiyel çözüm sayımız $N^K$ olur.&lt;br /&gt;&lt;br /&gt;B0. K kişiyi N odaya her odada en fazla bir kişi kalacak şekilde dağıtmak için yine herkese tek tek hangi odayı istediklerini sorabiliriz.  Fakat sırası gelen kişi kendinden önce seçilmiş odaları seçemez, boş odalardan birini seçebilir.  Yani birinci kişi N oda, ikinci kişi (N-1) oda, üçüncü kişi (N-2) odadan birini seçebilir.  Bu durumda çözüm sayımız N(N-1)...(N-K+1)=$N^\underline{K}$ olur.&lt;br /&gt;&lt;br /&gt;Bazan yukarıda verdiğimiz sayma tekniği kendi başına yeterli olmaz, çözümleri bilerek fazla ya da eksik sayıp sonradan bir düzeltme yapmak gerekebilir.&lt;br /&gt;&lt;br /&gt;B1. K topu N odaya her odada en fazla bir top olacak şekilde dağıtma problemine bakalım.  Topların kimliği önemli olmadığına göre buna N odadan K tanesini seçme problemi olarak da bakabiliriz.  Çözüme B0 gibi başlayıp her topa hangi odayı istediğini soralım.  Örneğin 3 top ve 5 oda varsa 123, 231, 245 gibi $N^\underline{K}$ farklı cevap alabiliriz.  Fakat topların kimlikleri önemli olmadığına göre 123, 132, 213, 231, 312, 321 cevaplarının hep aynı 3 odayı doldurdukları için aynı konfigürasyon olarak sayılmaları gerekir.  Diğer bir deyişle aslında B0 çözümü kullanarak biz her konfigürasyonu 6=3!=K! defa saymış olduk.  Doğru cevabı elde etmek için B0'daki cevabı K!'e bölerek $N^{\underline{K}} / K! = \tbinom{N}{K}$ formülünü elde ederiz.&lt;br /&gt;&lt;br /&gt;Bazı problemlerde en pratik çözüm ise önce eldeki problemi saymasını bildiğimiz başka bir probleme dönüştürmekten geçer:&lt;br /&gt;&lt;br /&gt;A1. K topu N odaya herhangi bir sınırlama olmaksızın dağıtmayı düşünelim.  Bu topları N odaya koymakla onları sıraya dizip aralarına N-1 duvar yerleştirmek birbirine eş iki problem olarak görülebilir.  İkinci problemde duvarlarla toplar K+N-1 pozisyon işgal ederler.  Bu pozisyonlardan hangi K tanesini topların işgal edeceğini sorarsak $\binom{K+N-1}{K}$ çözümünü elde ederiz.&lt;br /&gt;&lt;br /&gt;C1. K topu N odaya her odada en az bir top olacak şekilde dağıtma (K&gt;=N) probleminde ise önce her odaya birer top koyarak boş oda kalmamasını garantiler, daha sonra kalan K-N top üzerinde A1 çözümünü uygulayabiliriz: $\binom{K-1}{K-N}$.&lt;br /&gt;&lt;br /&gt;Son olarak bu problemlerin bir kısmının kapalı bir formülle çözümü yoktur.  Bu durumlarda özyinelemeli (recursive) formüller aramak tek çıkar yol olabilir:&lt;br /&gt;&lt;br /&gt;C2. K kişiyi N takıma (her takımda en az bir kişi olacak şekilde, K&gt;=N) ayırma problemini ele alalım.  Özyinelemeli çözümlerin ana fikri problemin daha küçük bir halini çözebildiğimizi varsaymak, ve buna dayanarak orijinal problemi çözmektir.  C2 için önce K-1 kişiyi takımlara ayırmanın yollarını sayabildiğimizi varsayalım.  K'inci kişiyi ne yapacağımızı düşünelim.  Bu son kişi ya kendi başına bir takım olacak (bu durumda geri kalan K-1 kişi N-1 takım oluşturacak), ya da diğerlerinin oluşturduğu takımlardan birinin içine eklenecek (bu durumda geri kalan K-1 kişinin N takım oluşturması, ve son kişinin bunlardan birini seçmesi gerek).  Bu dediklerimizi formüle dökersek: f(K, N) = f(K-1, N-1) + N f(K-1, N).  Formülü tamamlamak için problemin en küçük halini elle çözebiliriz f(1,1) = 1.  Bu çözüm bize genel olarak K elemanlı bir kümenin elemanlarının kaç değişik şekilde N parçaya ayrılabileceğini verir ve hesapladığımız f fonksiyonu da &lt;a href="http://en.wikipedia.org/wiki/Stirling_number_of_the_second_kind"&gt;ikinci tip Stirling sayısı&lt;/a&gt; olarak bilinir.&lt;br /&gt;&lt;br /&gt;C3. K topu N gruba ayırma problemi için ise en küçük grupta tek top olan çözümlerle birden fazla top olan çözümleri ayrı ayrı sayalım.   En küçük grupta tek top olan çözümlerin sayısını geri kalan K-1 topu N-1 gruba ayırarak bulabiliriz.  En küçük grupta birden fazla top olan çözümleri ise (daha önce C1'de yaptığımız gibi) önce her gruba garanti olan birer topu koyup geri kalan K-N topu N gruba ayırarak  sayabiliriz.  Formüle dökecek olursak: f(K, N) = f(K-1, N-1) + f(K-N, N) ve f(1, 1) = 1.  Bu çözüm bize genel olarak bir K tamsayısının kaç değişik şekilde N parçaya ayrılabileceğini verir. &lt;br /&gt;&lt;br /&gt;Diğer çözdüğümüz problemlerde de benzer tekniklerle özyinelemeli formüller bulabiliriz.  Örneğin:&lt;br /&gt;&lt;br /&gt;A0. K-1 kişi hangi odalarda kalacaklarına karar verdikten sonra son kişi yine N odadan birini seçer.&lt;br /&gt;$N^K$: f(N,K) = N f(N,K-1)&lt;br /&gt;&lt;br /&gt;B0. Son odada kimsenin kalmadığı ve birinin kaldığı çözümleri toplarız.&lt;br /&gt;$N^{\underline{K}}$:  f(N,K) = f(N-1,K) + K f(N-1,K-1)&lt;br /&gt;&lt;br /&gt;B1. Son odaya top konulan ve konulmayan çözümleri toplarız.&lt;br /&gt;$\tbinom{N}{K}$:  f(N,K) = f(N-1,K-1) + f(N-1,K) &lt;br /&gt;&lt;br /&gt;Kalan problemlerin çözümünü okuyucuya bırakıyorum.  İpucu olarak C0 ve A2'nin C2 kullanarak, A3'un ise C3 kullanarak çözülebileceğini B2 ile B3'un ise pek ilginç çözümleri olmadığını söyleyebilirim.&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-6538535957097277009?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://en.wikipedia.org/wiki/Twelvefold_way' title='Permutasyon, Kombinasyon ve 12 kardeşleri'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/6538535957097277009/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=6538535957097277009' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/6538535957097277009'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/6538535957097277009'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2010/02/permutasyon-kombinasyon-ve-12.html' title='Permutasyon, Kombinasyon ve 12 kardeşleri'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-5552888288056565952</id><published>2009-12-11T16:26:00.001+02:00</published><updated>2011-01-18T11:58:09.378+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Publications'/><title type='text'>Unsupervised morphological disambiguation using statistical language models</title><content type='html'>Mehmet Ali Yatbaz and Deniz Yuret.  &lt;i&gt;NIPS 2009 Workshop on Grammar Induction, Representation of Language and Language Learning.&lt;/i&gt;  December 2009.  (&lt;a href="https://docs.google.com/uc?id=0B6C4-zOYlkxsZGQ1MmRhMjQtNTFmNS00MjM0LWFkMDQtYTVkNDU4ZjMxNDg5&amp;export=download&amp;hl=en"&gt;PDF&lt;/a&gt;, &lt;a href="https://docs.google.com/uc?id=0B6C4-zOYlkxsOGMzMWIyODItNTljZC00OWIxLThkNTEtMTk3ZTJjM2E4MzQy&amp;export=download&amp;hl=en"&gt;Poster&lt;/a&gt;)&lt;span class="fullpost"&gt;&lt;br /&gt;&lt;b&gt;Abstract:&lt;/b&gt;&lt;br /&gt;In this paper, we present a probabilistic model for the unsupervised morphological disambiguation problem. Our model assigns morphological parses T to the contexts C instead of assigning them to the words W. The target word $w \in W$ determines the possible parse set $T_w \subset T$ that can be used in $w$'s context $c_w \in C$. To assign the correct morphological parse $t\in T_w$ to $w$, our model finds the parse $t\in T_w$ that maximizes $P(t|c_w)$. $P(t|c_w)$'s are estimated using a statistical language model and the vocabulary of the corpus. The system performs significantly better than an unsupervised baseline and its performance is close to a supervised baseline.&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-5552888288056565952?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/5552888288056565952/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=5552888288056565952' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/5552888288056565952'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/5552888288056565952'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2009/12/unsupervised-morphological.html' title='Unsupervised morphological disambiguation using statistical language models'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-5988565105413804051</id><published>2009-11-17T17:00:00.003+02:00</published><updated>2010-11-03T08:27:27.305+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Links'/><title type='text'>How to speak</title><content type='html'>******************&lt;br /&gt;KOC UNIVERSITY&lt;br /&gt;ECOE 590 SEMINAR&lt;br /&gt;******************&lt;br /&gt;&lt;br /&gt;Speaker  : Patrick Winston (video presentation)&lt;br /&gt;Title    : How to speak&lt;br /&gt;Date     : 17 November 2009, Tuesday&lt;br /&gt;Time     : 17:00&lt;br /&gt;Place    : ENG B30&lt;br /&gt;Refreshments will be served at 16:45&lt;br /&gt;&lt;br /&gt;Abstract: In this skillful lecture, Professor Patrick Winston of the Massachusetts Institute of Technology offers tips on how to give an effective talk, cleverly illustrating his suggestions by using them himself. He emphasizes how to start a lecture, cycling in on the material, using verbal punctuation to indicate transitions, describing "near misses" that strengthen the intended concept, and asking questions. He also talks about using the blackboard, overhead projections, props, and "how to stop."&lt;br /&gt;&lt;br /&gt;Video available at &lt;a href="http://isites.harvard.edu/fs/html/icb.topic58703/winston1.html"&gt;http://isites.harvard.edu/fs/html/icb.topic58703/winston1.html&lt;/a&gt;&lt;br /&gt;&lt;span class="fullpost"&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-5988565105413804051?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://isites.harvard.edu/fs/html/icb.topic58703/winston1.html' title='How to speak'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/5988565105413804051/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=5988565105413804051' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/5988565105413804051'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/5988565105413804051'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2009/11/how-to-speak.html' title='How to speak'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-3579132866597436376</id><published>2009-08-27T12:22:00.000+03:00</published><updated>2010-11-03T09:08:54.395+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Downloads'/><title type='text'>Turkish Language Resources</title><content type='html'>This post contains links to various Turkish language resources that I have collected.  Please send a comment if you find Turkish resources that you would like to see on this page.&lt;span class="fullpost"&gt;&lt;br /&gt;&lt;br /&gt;&lt;dt&gt;&lt;a href="http://www.ozguryilmazel.com/turkish.html"&gt;Bibliography&lt;/a&gt;&lt;br /&gt;&lt;/dt&gt;&lt;dd&gt; Özgür Yılmazel's Bibliography on Turkish Information Retrieval and Natural Language Processing.&lt;br /&gt;&lt;/dd&gt;&lt;dt&gt;&lt;a href="http://deniz.yuret.com/turkish/tr-disamb.tgz"&gt; tr-disamb.tgz&lt;/a&gt;&lt;br /&gt;&lt;/dt&gt;&lt;dd&gt; Turkish morphological disambiguator code.  Slow but 96% accurate.  See &lt;a href="http://denizyuret.blogspot.com/2006/06/learning-morphological-disambiguation.html"&gt; Learning morphological disambiguation rules for Turkish&lt;/a&gt; for the theory.&lt;br /&gt;&lt;/dd&gt;&lt;dt&gt;&lt;a href="http://deniz.yuret.com/turkish/correctparses_03.txt.gz"&gt; correctparses_03.txt.gz&lt;/a&gt;, &lt;a href="http://deniz.yuret.com/turkish/train.merge.gz"&gt; train.merge.gz&lt;/a&gt;&lt;br /&gt;&lt;/dt&gt;&lt;dd&gt;Turkish morphology training files.  Semi-automatically tagged, has limited accuracy.  Two files have the same data except the second file also includes the ambiguous parses (the first parse on each line is correct).&lt;br /&gt;&lt;/dd&gt;&lt;dt&gt;&lt;a href="http://deniz.yuret.com/turkish/test.1.2.dis.gz"&gt; test.1.2.dis.gz&lt;/a&gt;, &lt;a href="http://deniz.yuret.com/turkish/test.merge.gz"&gt; test.merge.gz&lt;/a&gt;&lt;br /&gt;&lt;/dt&gt;&lt;dd&gt;Turkish morphology test files, second one includes ambiguous parses (the first parse on each line is correct).  The data is hand tagged, it has good accuracy.&lt;br /&gt;&lt;/dd&gt;&lt;dt&gt;&lt;a href="http://deniz.yuret.com/turkish/tr-tagger.tgz"&gt;tr-tagger.tgz&lt;/a&gt;&lt;br /&gt;&lt;/dt&gt;&lt;dd&gt; Turkish morphological tagger, includes Oflazer's finite state machines for Turkish.  From Kemal Oflazer. Please use with permission.  Requires the publically available &lt;a href="http://www.fsmbook.com/"&gt;Xerox Finite State software&lt;/a&gt;.&lt;br /&gt;&lt;/dd&gt;&lt;dt&gt;&lt;a href="http://deniz.yuret.com/turkish/turklex.tgz"&gt; turklex.tgz&lt;/a&gt;, &lt;a href="http://deniz.yuret.com/turkish/pc_kimmo.tgz"&gt; pc_kimmo.tgz&lt;/a&gt;&lt;br /&gt;&lt;/dt&gt;&lt;dd&gt;Turkish morphology rules for PC-Kimmo by Kemal Oflazer.  Older implementation. Originally from &lt;a href="http://www.cs.cmu.edu/afs/cs/project/ai-repository/ai/areas/nlp/morph/pc_kimmo/turklex"&gt; www.cs.cmu.edu&lt;/a&gt;&lt;br /&gt;&lt;/dd&gt;&lt;dt&gt;&lt;a href="http://deniz.yuret.com/turkish/Milliyet1.bz2"&gt; Milliyet1.bz2&lt;/a&gt;, &lt;a href="http://deniz.yuret.com/turkish/Milliyet2.bz2"&gt; Milliyet2.bz2&lt;/a&gt;, &lt;a href="http://deniz.yuret.com/turkish/Milliyet3.bz2"&gt; Milliyet3.bz2&lt;/a&gt;&lt;br /&gt;&lt;/dt&gt;&lt;dd&gt; Original Milliyet corpus, one token per line, 19,627,500 total tokens.  Latin-5 encoded, in three 11MB parts.  From Kemal Oflazer.  Please use with permission.&lt;br /&gt;&lt;/dd&gt;&lt;dt&gt;&lt;a href="http://people.sabanciuniv.edu/~oflazer/balkanet/index.htm"&gt; Turkish wordnet&lt;/a&gt;&lt;br /&gt;&lt;/dt&gt;&lt;dd&gt; From Kemal Oflazer.  Please use with permission. &lt;br /&gt;&lt;/dd&gt;&lt;dt&gt;&lt;a href="http://www.ii.metu.edu.tr/~corpus"&gt;METU-Sabanci Turkish Treebank&lt;/a&gt;&lt;br /&gt;&lt;/dt&gt;&lt;dd&gt; Turkish treebank with dependency annotations.  Please use with permission. &lt;br /&gt;&lt;/dd&gt;&lt;dt&gt;&lt;a href="http://deniz.yuret.com/turkish/sozluk.txt.gz"&gt; sozluk.txt.gz&lt;/a&gt;&lt;br /&gt;&lt;/dt&gt;&lt;dd&gt; English-Turkish dictionary (127157 entries, 826K) Originally from &lt;a href="http://www.fen.bilkent.edu.tr/~aykutlu/sozluk.txt"&gt; www.fen.bilkent.edu.tr/~aykutlu&lt;/a&gt;.&lt;br /&gt;&lt;/dd&gt;&lt;dt&gt;&lt;a href="http://deniz.yuret.com/turkish/sozluk-boun.txt.gz"&gt; sozluk-boun.txt.gz&lt;/a&gt;&lt;dt&gt;&lt;dd&gt; Turkish word list (25822 words, 73K) Originally from &lt;a href="http://www.cmpe.boun.edu.tr/courses/cmpe230/common/TURKISH/sozluk.txt"&gt; www.cmpe.boun.edu.tr/courses/cmpe230&lt;/a&gt;&lt;br /&gt;&lt;/dd&gt;&lt;dt&gt;&lt;a href="http://deniz.yuret.com/turkish/SOZLUK_BASKI.pdf"&gt; Avrupa Birliği Temel Terimler Sözlüğü&lt;/a&gt;&lt;br /&gt;&lt;/dt&gt;&lt;dd&gt;(Originally from: &lt;a href="http://www.abgs.gov.tr/ab_dosyalar/SOZLUK%20BASKI.pdf"&gt; www.abgs.gov.tr/ab_dosyalar&lt;/a&gt;, Oct 6, 2006) &lt;br /&gt;&lt;/dd&gt;&lt;dt&gt;&lt;a href="http://deniz.yuret.com/turkish/BilisimSozlugu.zip"&gt; BilisimSozlugu.zip&lt;/a&gt;&lt;br /&gt;&lt;/dt&gt;&lt;dd&gt;Bilişim Sözlüğü by &lt;a href="http://www.busim.ee.boun.edu.tr/~sankur"&gt;Bülent Sankur&lt;/a&gt; (Originally from: &lt;a href="http://www.bilisimsozlugu.com"&gt;www.bilisimsozlugu.com&lt;/a&gt;, Oct 9, 2006) &lt;br /&gt;&lt;/dd&gt;&lt;dt&gt;&lt;a href="http://deniz.yuret.com/turkish/turkish.el"&gt; turkish.el&lt;/a&gt;&lt;br /&gt;&lt;/dt&gt;&lt;dd&gt;Emacs extension that automatically adds accents to Turkish words while typing on an English keyboard.&lt;br /&gt;&lt;/dd&gt;&lt;dt&gt;&lt;a href="http://deniz.yuret.com/turkish/en-tr.zip"&gt; en-tr.zip&lt;/a&gt;, &lt;a href="http://deniz.yuret.com/turkish/lm.tr.gz"&gt; lm.tr.gz&lt;/a&gt;&lt;br /&gt;&lt;/dt&gt;&lt;dd&gt;Turkish English parallel text from Kemal Oflazer, Statistical Machine Translation into a Morphologically Complex Language, Invited Paper, In Proceedings of &lt;a href="http://www.gelbukh.com/cicling/2008/"&gt;CICLING 2008&lt;/a&gt; -- Conference on Intelligent Text Processing and Computational Linguistics, Haifa, Israel, February 2008 (lowercased and converted to utf8).  The Turkish part of the dataset is "selectively split", i.e. some suffixes are separated from their stems, some are not.  lm.tr.gz is the Turkish text used to develop the language model.&lt;br /&gt;&lt;/dd&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-3579132866597436376?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/3579132866597436376/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=3579132866597436376' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/3579132866597436376'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/3579132866597436376'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2006/11/turkish-resources.html' title='Turkish Language Resources'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-1431456418800221383</id><published>2009-08-14T08:59:00.003+03:00</published><updated>2010-11-03T08:14:40.092+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Students'/><title type='text'>Önder Eker, M.S. 2009</title><content type='html'>&lt;b&gt;Parser Evaluation Using Textual Entailments&lt;/b&gt;&lt;br /&gt;Önder Eker. M.S. Thesis. Boğaziçi Üniversitesi Department of Computer Engineering, August 2009. (&lt;a href="http://deniz.yuret.com/2009/08/onder-eker-ms-2009/thesis.pdf"&gt;PDF&lt;/a&gt;)&lt;br /&gt;&lt;span class="fullpost"&gt;&lt;br /&gt;&lt;b&gt;Abstract&lt;/b&gt;&lt;br /&gt;Syntactic parsing is a basic problem in natural language processing. It can be deﬁned as assigning a structure to a sentence. Two prevalent approaches to parsing are phrase-structure parsing and dependency parsing. A related problem is parser evaluation. PETE is a dependency-based evaluation where the parse is represented as a list of simple sentences, similar to the Recognizing Textual Entailments task. Each entailment focuses on one relation. A priori training of annotators is not required.  A program generates entailments from a dependency parse. Phrase-structure parses are converted to dependency parses to generate entailments. Additional entailments are generated for phrase-structure coordinations. Experiments are carried out with a function-tagger. Parsers are evaluated on the set of entailments generated from the Penn Treebank WSJ and Brown test sections. A phrase-structure parser obtained the highest score.&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-1431456418800221383?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/1431456418800221383/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=1431456418800221383' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/1431456418800221383'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/1431456418800221383'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2009/08/onder-eker-ms-2009.html' title='Önder Eker, M.S. 2009'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-1214261912916945968</id><published>2009-08-07T13:56:00.002+03:00</published><updated>2010-11-03T09:08:54.396+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Notes'/><title type='text'>ACL 2009 Notes</title><content type='html'>Some notable ACL talks with links and notes...&lt;span class="fullpost"&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;p&gt;&lt;a href="http://www.acl-ijcnlp-2009.org/main/tutorials.html#T2"&gt;Tutorial:&lt;/a&gt; &lt;b&gt;Kevin Knight, Philipp Koehn&lt;/b&gt;.&lt;br /&gt;&lt;i&gt;Topics in Statistical Machine Translation&lt;/i&gt;&lt;br /&gt;MT:  Phrase based, hierarchical, and syntax based approaches.  Hiero is equivalent? to a syntax based approach with a single nonterminal.  Minimum Bayes Risk (MBR) chooses not the best option but the one that has maximum expected BLEU.  Approaches that work with lattices and forests.  System combination provides significant gain.  Integrating LM into decoding improves.  Cube pruning makes hiero and syntax based more efficient.  Throwing out 99% of the phrase table gives no loss.  Factored models help when factors used as back off.  Reordering tried before.  The source and target can be string, tree, or forest.  Arabic and Chinese seem most popular.  Good test: can you put a jumbled up sentence in the right order.  If we could output only grammatical sentences (simplified English?).  Dependency LM for output language.  Lattice translation.  Giza alignments not very accurate, guess = 80%.  Bleu between human translators is at the level of best systems, i.e. cannot use as upper bound.&lt;br /&gt;&lt;/p&gt;&lt;/li&gt;&lt;li&gt;&lt;p&gt;&lt;a href="http://www.acl-ijcnlp-2009.org/main/tutorials.html#T6"&gt;Tutorial:&lt;/a&gt; &lt;b&gt;Simone Paolo Ponzetto and Massimo Poesio.&lt;/b&gt;&lt;br /&gt;&lt;i&gt;State-of-the-art NLP Approaches to Coreference Resolution: Theory and Practical Recipes&lt;/i&gt;&lt;br /&gt;Coref: ACE is current standard dataset.  Also MUC and other new ones.  Anaphora approx 50% proper nouns, 40% noun phrases, 10% pronouns.  NPs most difficult.  Tough to know when discourse-new.  Have evaluation problems like other fields.  Would think deciding on anaphora easier for annotators, but issues like whether to consider China and its population anaphoric.&lt;br /&gt;&lt;/p&gt;&lt;/li&gt;&lt;li&gt;&lt;p&gt;&lt;a href="http://aclweb.org/anthology-new/P/P09/P09-1001.pdf"&gt;P09-1001&lt;/a&gt; [&lt;a href="http://aclweb.org/anthology-new/P/P09/P09-1001.bib"&gt;bib&lt;/a&gt;]: &lt;b&gt;Qiang Yang; Yuqiang Chen; Gui-Rong Xue; Wenyuan Dai; Yong Yu&lt;/b&gt;.&lt;br /&gt;&lt;i&gt;Heterogeneous Transfer Learning for Image Clustering via the SocialWeb&lt;/i&gt;&lt;br /&gt;ML: Qiang Yang gave the first invited talk.  When the training and test sets have different distributions or different representations.  Did not talk much about: when train and test have different labels.  Link to causality.  Unsupervised pre-learning boosting supervised learning curve.&lt;br /&gt;&lt;/p&gt;&lt;/li&gt;&lt;li&gt;&lt;p&gt;&lt;a href="http://aclweb.org/anthology-new/P/P09/P09-1002.pdf"&gt;P09-1002&lt;/a&gt; [&lt;a href="http://aclweb.org/anthology-new/P/P09/P09-1002.bib"&gt;bib&lt;/a&gt;]: &lt;b&gt;Katrin Erk; Diana McCarthy; Nicholas Gaylord.&lt;/b&gt;&lt;br /&gt;&lt;i&gt;Investigations on Word Senses and Word Usages&lt;/i&gt;&lt;br /&gt;WSD: Annotators provide scores 1-5 for two tasks: how good a fit between a usage and sense, how close are two usages of same word.  Claim forcing annotators to single decision detrimental.  Also claim coarse senses insufficient to explain results.&lt;br /&gt;&lt;/p&gt;&lt;/li&gt;&lt;li&gt;&lt;p&gt;&lt;a href="http://aclweb.org/anthology-new/P/P09/P09-1010.pdf"&gt;P09-1010&lt;/a&gt; [&lt;a href="http://aclweb.org/anthology-new/P/P09/P09-1010.bib"&gt;bib&lt;/a&gt;]: &lt;b&gt;S.R.K. Branavan; Harr Chen; Luke Zettlemoyer; Regina Barzilay&lt;/b&gt;.&lt;br /&gt;&lt;i&gt;Reinforcement Learning for Mapping Instructions to Actions&lt;/i&gt;&lt;br /&gt;Situated language: Best paper award.  Good work goes beyond studying language in isolation.  Reinforcement results sound incredibly good, number of features pretty small, how much prior info did they exactly use?&lt;br /&gt;&lt;/p&gt;&lt;/li&gt;&lt;li&gt;&lt;p&gt;&lt;a href="http://aclweb.org/anthology-new/P/P09/P09-1011.pdf"&gt;P09-1011&lt;/a&gt; [&lt;a href="http://aclweb.org/anthology-new/P/P09/P09-1011.bib"&gt;bib&lt;/a&gt;]: &lt;b&gt;Percy Liang; Michael Jordan; Dan Klein&lt;/b&gt;&lt;br /&gt;&lt;i&gt;Learning Semantic Correspondences with Less Supervision&lt;/i&gt;&lt;br /&gt;Semantic representations: Learn semantic mappings in the domains of weather, robocup sportscasting, and NFL recaps when it is not clear what record and what field the text is referring to.&lt;br /&gt;&lt;/p&gt;&lt;/li&gt;&lt;li&gt;&lt;p&gt;&lt;a href="http://aclweb.org/anthology-new/P/P09/P09-1009.pdf"&gt;P09-1009&lt;/a&gt; [&lt;a href="http://aclweb.org/anthology-new/P/P09/P09-1009.bib"&gt;bib&lt;/a&gt;]: &lt;b&gt;Benjamin Snyder; Tahira Naseem; Regina Barzilay&lt;/b&gt;&lt;br /&gt;&lt;i&gt;Unsupervised Multilingual Grammar Induction&lt;/i&gt;&lt;br /&gt;Syntax: A candidate constituent in one language may be split in another preventing wrong rules to be learnt.&lt;br /&gt;&lt;/p&gt;&lt;/li&gt;&lt;li&gt;&lt;p&gt;&lt;a href="http://aclweb.org/anthology-new/P/P09/P09-1024.pdf"&gt;P09-1024&lt;/a&gt; [&lt;a href="http://aclweb.org/anthology-new/P/P09/P09-1024.bib"&gt;bib&lt;/a&gt;]: &lt;b&gt;Christina Sauper; Regina Barzilay&lt;/b&gt;&lt;br /&gt;&lt;i&gt;Automatically Generating Wikipedia Articles: A Structure-Aware Approach&lt;/i&gt;&lt;br /&gt;Summarization: I did not know summarization consists of cutting and pasting existing text.&lt;br /&gt;&lt;/p&gt;&lt;/li&gt;&lt;li&gt;&lt;p&gt;&lt;a href="http://aclweb.org/anthology-new/P/P09/P09-1025.pdf"&gt;P09-1025&lt;/a&gt; [&lt;a href="http://aclweb.org/anthology-new/P/P09/P09-1025.bib"&gt;bib&lt;/a&gt;]: &lt;b&gt;Neil McIntyre; Mirella Lapata&lt;/b&gt;&lt;br /&gt;&lt;i&gt;Learning to Tell Tales: A Data-driven Approach to Story Generation&lt;/i&gt;&lt;br /&gt;Schemas: Learning a model of fairy tales to generate new ones.  Nice idea but resulting stories not so good.  Better models possible.&lt;br /&gt;&lt;/p&gt;&lt;/li&gt;&lt;li&gt;&lt;p&gt;&lt;a href="http://aclweb.org/anthology-new/P/P09/P09-1034.pdf"&gt;P09-1034&lt;/a&gt; [&lt;a href="http://aclweb.org/anthology-new/P/P09/P09-1034.bib"&gt;bib&lt;/a&gt;]: &lt;b&gt;Sebastian Pado; Michel Galley; Dan Jurafsky; Christopher D. Manning&lt;/b&gt;&lt;br /&gt;&lt;i&gt;Robust Machine Translation Evaluation with Entailment Features&lt;/i&gt;&lt;br /&gt;MT: Compared to human judgement Meteor does best (significantly better than Bleu) among shallow evaluation metrics.  Using RTE to see if the produced translation is an entailment or paraphrase of the reference does better.&lt;/p&gt;&lt;/li&gt;&lt;li&gt;&lt;p&gt;&lt;a href="http://aclweb.org/anthology-new/P/P09/P09-1039.pdf"&gt;P09-1039&lt;/a&gt; [&lt;a href="http://aclweb.org/anthology-new/P/P09/P09-1039.bib"&gt;bib&lt;/a&gt;]: &lt;b&gt;Andre Martins; Noah Smith; Eric Xing&lt;/b&gt;&lt;br /&gt;&lt;i&gt;Concise Integer Linear Programming Formulations for Dependency Parsing&lt;/i&gt;&lt;br /&gt;Syntax: Best paper award.&lt;br /&gt;&lt;/p&gt;&lt;/li&gt;&lt;li&gt;&lt;p&gt;&lt;a href="http://aclweb.org/anthology-new/P/P09/P09-1040.pdf"&gt;P09-1040&lt;/a&gt; [&lt;a href="http://aclweb.org/anthology-new/P/P09/P09-1040.bib"&gt;bib&lt;/a&gt;]: &lt;b&gt;Joakim Nivre&lt;/b&gt;&lt;br /&gt;&lt;i&gt;Non-Projective Dependency Parsing in Expected Linear Time&lt;/i&gt;&lt;br /&gt;Syntax: By adding one more operation that swaps tokens to the shift reduce parser, generation of nonprojective parses possible.&lt;br /&gt;&lt;/p&gt;&lt;/li&gt;&lt;li&gt;&lt;p&gt;&lt;a href="http://aclweb.org/anthology-new/P/P09/P09-1041.pdf"&gt;P09-1041&lt;/a&gt; [&lt;a href="http://aclweb.org/anthology-new/P/P09/P09-1041.bib"&gt;bib&lt;/a&gt;]: &lt;b&gt;Gregory Druck; Gideon Mann; Andrew McCallum&lt;/b&gt;&lt;br /&gt;&lt;i&gt;Semi-supervised Learning of Dependency Parsers using Generalized Expectation Criteria&lt;/i&gt;&lt;br /&gt;Syntax: Instead of labeled data, use expectation constraints in training parser.&lt;br /&gt;&lt;/p&gt;&lt;/li&gt;&lt;li&gt;&lt;p&gt;&lt;a href="http://aclweb.org/anthology-new/P/P09/P09-1042.pdf"&gt;P09-1042&lt;/a&gt; [&lt;a href="http://aclweb.org/anthology-new/P/P09/P09-1042.bib"&gt;bib&lt;/a&gt;]: &lt;b&gt;Kuzman Ganchev; Jennifer Gillenwater; Ben Taskar&lt;/b&gt;&lt;br /&gt;&lt;i&gt;Dependency Grammar Induction via Bitext Projection Constraints&lt;/i&gt;&lt;br /&gt;Syntax: Similar to above, but uses bitext constraints.&lt;/p&gt;&lt;/li&gt;&lt;li&gt;&lt;p&gt;&lt;a href="http://aclweb.org/anthology-new/P/P09/P09-1057.pdf"&gt;P09-1057&lt;/a&gt; [&lt;a href="http://aclweb.org/anthology-new/P/P09/P09-1057.bib"&gt;bib&lt;/a&gt;]: &lt;b&gt;Sujith Ravi; Kevin Knight&lt;/b&gt;&lt;br /&gt;&lt;i&gt;Minimized Models for Unsupervised Part-of-Speech Tagging&lt;/i&gt;&lt;br /&gt;Syntax: Best paper award.&lt;br /&gt;&lt;/p&gt;&lt;/li&gt;&lt;li&gt;&lt;p&gt;&lt;a href="http://aclweb.org/anthology-new/P/P09/P09-1068.pdf"&gt;P09-1068&lt;/a&gt; [&lt;a href="http://aclweb.org/anthology-new/P/P09/P09-1068.bib"&gt;bib&lt;/a&gt;]: &lt;b&gt;Nathanael Chambers; Dan Jurafsky&lt;/b&gt;&lt;br /&gt;&lt;i&gt;Unsupervised Learning of Narrative Schemas and their Participants&lt;/i&gt;&lt;br /&gt;Schemas: very nice work modeling structure of NYT stories.  Could be improved by focusing on a particular genre and introducing narrative ordering to model (apparently time ordering is really difficult).&lt;br /&gt;&lt;/p&gt;&lt;/li&gt;&lt;li&gt;&lt;p&gt;&lt;a href="http://aclweb.org/anthology-new/P/P09/P09-1070.pdf"&gt;P09-1070&lt;/a&gt; [&lt;a href="http://aclweb.org/anthology-new/P/P09/P09-1070.bib"&gt;bib&lt;/a&gt;]: &lt;b&gt;Joseph Reisinger; Marius Pasca&lt;/b&gt;&lt;br /&gt;&lt;i&gt;Latent Variable Models of Concept-Attribute Attachment&lt;/i&gt;&lt;br /&gt;SemRel: unsupervised learning of concept clusters and attributes for each cluster from text.&lt;br /&gt;&lt;/p&gt;&lt;/li&gt;&lt;li&gt;&lt;p&gt;&lt;a href="http://aclweb.org/anthology-new/P/P09/P09-1072.pdf"&gt;P09-1072&lt;/a&gt; [&lt;a href="http://aclweb.org/anthology-new/P/P09/P09-1072.bib"&gt;bib&lt;/a&gt;]: &lt;b&gt;Kai-min K. Chang; Vladimir L. Cherkassky; Tom M. Mitchell; Marcel Adam Just&lt;/b&gt;&lt;br /&gt;&lt;i&gt;Quantitative modeling of the neural representation of adjective-noun phrases to account for fMRI activation&lt;/i&gt;&lt;br /&gt;Brain: continuing the work of brain imaging.  Some success in guessing which adj-noun pair being thought.  Better questions can be asked.&lt;br /&gt;&lt;/p&gt;&lt;/li&gt;&lt;li&gt;&lt;p&gt;&lt;a href="http://aclweb.org/anthology-new/P/P09/P09-2062.pdf"&gt;P09-2062&lt;/a&gt; [&lt;a href="http://aclweb.org/anthology-new/P/P09/P09-2062.bib"&gt;bib&lt;/a&gt;]: &lt;b&gt;Chris Biemann; Monojit Choudhury; Animesh Mukherjee&lt;/b&gt;&lt;br /&gt;&lt;i&gt;Syntax is from Mars while Semantics from Venus! Insights from Spectral Analysis of Distributional Similarity Networks&lt;/i&gt;&lt;br /&gt;WSD: Qualitative differences between distributional similarity networks for semantics and syntax.  Does it say anything about word meaning representation?&lt;br /&gt;&lt;/p&gt;&lt;/li&gt;&lt;li&gt;&lt;p&gt;&lt;a href="http://aclweb.org/anthology-new/P/P09/P09-2059.pdf"&gt;P09-2059&lt;/a&gt; [&lt;a href="http://aclweb.org/anthology-new/P/P09/P09-2059.bib"&gt;bib&lt;/a&gt;]: &lt;b&gt;Gumwon Hong; Seung-Wook Lee; Hae-Chang Rim&lt;/b&gt;&lt;br /&gt;&lt;i&gt;Bridging Morpho-Syntactic Gap between Source and Target Sentences for English-Korean Statistical Machine Translation&lt;/i&gt;&lt;br /&gt;MT: Problems similar to Turkish.  Collins '05 proposed reordering.  Lee 06 removed useless function words.  Hong inserts pseudo-words to xlate to Korean morphemes.&lt;br /&gt;&lt;/p&gt;&lt;/li&gt;&lt;li&gt;&lt;p&gt;&lt;a href="http://aclweb.org/anthology-new/P/P09/P09-2069.pdf"&gt;P09-2069&lt;/a&gt; [&lt;a href="http://aclweb.org/anthology-new/P/P09/P09-2069.bib"&gt;bib&lt;/a&gt;]: &lt;b&gt;Haşim Sak; Tunga Güngör; Murat Saraçlar&lt;/b&gt;&lt;br /&gt;&lt;i&gt;A Stochastic Finite-State Morphological Parser for Turkish&lt;/i&gt;&lt;br /&gt;Mor: A probabilistic generative model for Turkish words.&lt;br /&gt;&lt;/p&gt;&lt;/li&gt;&lt;li&gt;&lt;p&gt;&lt;a href="http://aclweb.org/anthology-new/P/P09/P09-1076.pdf"&gt;P09-1076&lt;/a&gt; [&lt;a href="http://aclweb.org/anthology-new/P/P09/P09-1076.bib"&gt;bib&lt;/a&gt;]: &lt;b&gt;Bonnie Webber&lt;/b&gt;&lt;br /&gt;&lt;i&gt;Genre distinctions for discourse in the Penn TreeBank&lt;/i&gt;&lt;br /&gt;Invited talk - Discourse: topics seem relevant to Schema learning, should find a good tutorial.&lt;br /&gt;&lt;/p&gt;&lt;/li&gt;&lt;li&gt;&lt;p&gt;&lt;a href="http://aclweb.org/anthology-new/P/P09/P09-1087.pdf"&gt;P09-1087&lt;/a&gt; [&lt;a href="http://aclweb.org/anthology-new/P/P09/P09-1087.bib"&gt;bib&lt;/a&gt;]: &lt;b&gt;Michel Galley; Christopher D. Manning&lt;/b&gt;&lt;br /&gt;&lt;i&gt;Quadratic-Time Dependency Parsing for Machine Translation&lt;/i&gt;&lt;br /&gt;Syntax: nonprojective parsing tying each word to its most likely head.  Why did this not work when I tried it in CoNLL?  Gives O(n2).  Could you adopt Nivre for linear?  Unsupervised parsing?  Using dependency LM as a feature.&lt;br /&gt;&lt;/p&gt;&lt;/li&gt;&lt;li&gt;&lt;p&gt;&lt;a href="http://aclweb.org/anthology-new/P/P09/P09-1088.pdf"&gt;P09-1088&lt;/a&gt; [&lt;a href="http://aclweb.org/anthology-new/P/P09/P09-1088.bib"&gt;bib&lt;/a&gt;]: &lt;b&gt;Phil Blunsom; Trevor Cohn; Chris Dyer; Miles Osborne&lt;/b&gt;&lt;br /&gt;&lt;i&gt;A Gibbs Sampler for Phrasal Synchronous Grammar Induction&lt;/i&gt;&lt;br /&gt;MT: Bayesian magic.  Look into SCFGs.  Generates its own word alignment.  Works better on non-monotonic language pairs, monotonic ones difficult to improve on.&lt;br /&gt;&lt;/p&gt;&lt;/li&gt;&lt;li&gt;&lt;p&gt;&lt;a href="http://aclweb.org/anthology-new/P/P09/P09-1089.pdf"&gt;P09-1089&lt;/a&gt; [&lt;a href="http://aclweb.org/anthology-new/P/P09/P09-1089.bib"&gt;bib&lt;/a&gt;]: &lt;b&gt;Shachar Mirkin; Lucia Specia; Nicola Cancedda; Ido Dagan; Marc Dymetman; Idan Szpektor&lt;/b&gt;&lt;br /&gt;&lt;i&gt;Source-Language Entailment Modeling for Translating Unknown Terms&lt;/i&gt;&lt;br /&gt;MT: Generate paraphrases or entailments for unknown words using RTE.&lt;br /&gt;&lt;/p&gt;&lt;/li&gt;&lt;li&gt;&lt;p&gt;&lt;a href="http://aclweb.org/anthology-new/P/P09/P09-1090.pdf"&gt;P09-1090&lt;/a&gt; [&lt;a href="http://aclweb.org/anthology-new/P/P09/P09-1090.bib"&gt;bib&lt;/a&gt;]: &lt;b&gt;Ananthakrishnan Ramanathan; Hansraj Choudhary; Avishek Ghosh; Pushpak Bhattacharyya&lt;/b&gt;&lt;br /&gt;&lt;i&gt;Case markers and Morphology: Addressing the crux of the fluency problem in English-Hindi SMT&lt;/i&gt;&lt;br /&gt;MT: Reordering and factored model.  Fluency and adequacy manually evaluated in addition to BLEU.&lt;br /&gt;&lt;span style="text-decoration: underline;"&gt;&lt;/span&gt;&lt;/p&gt;&lt;/li&gt;&lt;li&gt;&lt;p&gt;&lt;a href="http://aclweb.org/anthology-new/P/P09/P09-1108.pdf"&gt;P09-1108&lt;/a&gt; [&lt;a href="http://aclweb.org/anthology-new/P/P09/P09-1108.bib"&gt;bib&lt;/a&gt;]: &lt;b&gt;Adam Pauls; Dan Klein&lt;/b&gt;&lt;br /&gt;&lt;i&gt;K-Best A* Parsing&lt;/i&gt;&lt;br /&gt;Syntax: Best paper award.&lt;br /&gt;&lt;/p&gt;&lt;/li&gt;&lt;li&gt;&lt;p&gt;&lt;a href="http://aclweb.org/anthology-new/P/P09/P09-1104.pdf"&gt;P09-1104&lt;/a&gt; [&lt;a href="http://aclweb.org/anthology-new/P/P09/P09-1104.bib"&gt;bib&lt;/a&gt;]: &lt;b&gt;Aria Haghighi; John Blitzer; John DeNero; Dan Klein&lt;/b&gt;&lt;br /&gt;&lt;i&gt;Better Word Alignments with Supervised ITG Models&lt;/i&gt;&lt;br /&gt;MT: Check if they have code available.  Claim 1.1 bleu improvement.&lt;br /&gt;&lt;/p&gt;&lt;/li&gt;&lt;li&gt;&lt;p&gt;&lt;a href="http://aclweb.org/anthology-new/P/P09/P09-1105.pdf"&gt;P09-1105&lt;/a&gt; [&lt;a href="http://aclweb.org/anthology-new/P/P09/P09-1105.bib"&gt;bib&lt;/a&gt;]: &lt;b&gt;Fei Huang&lt;/b&gt;&lt;br /&gt;&lt;i&gt;Confidence Measure for Word Alignment&lt;/i&gt;&lt;br /&gt;MT: Measure confidence based on posterior probability, improve alignments.&lt;br /&gt;&lt;/p&gt;&lt;/li&gt;&lt;li&gt;&lt;p&gt;&lt;a href="http://aclweb.org/anthology-new/P/P09/P09-1113.pdf"&gt;P09-1113&lt;/a&gt; [&lt;a href="http://aclweb.org/anthology-new/P/P09/P09-1113.bib"&gt;bib&lt;/a&gt;]: &lt;b&gt;Mike Mintz; Steven Bills; Rion Snow; Daniel Jurafsky&lt;/b&gt;&lt;br /&gt;&lt;i&gt;Distant supervision for relation extraction without labeled data&lt;/i&gt;&lt;br /&gt;SemRel: Unsupervised method.&lt;/p&gt;&lt;/li&gt;&lt;li&gt;&lt;p&gt;&lt;a href="http://aclweb.org/anthology-new/P/P09/P09-1116.pdf"&gt;P09-1116&lt;/a&gt; [&lt;a href="http://aclweb.org/anthology-new/P/P09/P09-1116.bib"&gt;bib&lt;/a&gt;]: &lt;b&gt;Dekang Lin; Xiaoyun Wu&lt;/b&gt;&lt;br /&gt;&lt;i&gt;Phrase Clustering for Discriminative Learning&lt;/i&gt;&lt;br /&gt;WSD: cluster phrases instead of words.  Much less ambiguous, so pure context.  Use different size clusters together, let the learning algorithm pick.  Similar to hierarchical.  Improves NER and query classification.  Any application where clustering words useful because of sparsity.  Clusters derived from 700B web data. Are the clusters available?&lt;br /&gt;&lt;/p&gt;&lt;/li&gt;&lt;li&gt;&lt;p&gt;&lt;a href="http://aclweb.org/anthology-new/P/P09/P09-1117.pdf"&gt;P09-1117&lt;/a&gt; [&lt;a href="http://aclweb.org/anthology-new/P/P09/P09-1117.bib"&gt;bib&lt;/a&gt;]: &lt;b&gt;Katrin Tomanek; Udo Hahn&lt;/b&gt;&lt;br /&gt;&lt;i&gt;Semi-Supervised Active Learning for Sequence Labeling&lt;/i&gt;&lt;br /&gt;ML: Self learning does not work because the instances with most confidence are not the useful ones.  Active learning asks for labels of instances with least confidence.  Boosting effect?&lt;/p&gt;&lt;/li&gt;&lt;li&gt;&lt;p&gt;&lt;a href="http://aclweb.org/anthology-new/D/D09/D09-1030.pdf"&gt;D09-1030&lt;/a&gt; [&lt;a href="http://aclweb.org/anthology-new/D/D09/D09-1030.bib"&gt;bib&lt;/a&gt;]: &lt;b&gt;Chris Callison-Burch&lt;/b&gt;&lt;br /&gt;&lt;i&gt;Fast, Cheap, and Creative: Evaluating Translation Quality Using Amazon’s Mechanical Turk&lt;/i&gt;&lt;br /&gt;MT: This article has one answer to the BLEU upper bound question among other things.  The following graph shows that professional humans still get higher Bleu compared to SMT systems (although this is using 10 reference translations).  They mention Google MT got higher Bleu but probably the test set was used in training.  Still gives relative performances.  Also, amazing things apparently can be done with Amazon Turk.  Should use them to judge Turkish alignment quality.&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/_306eXDjZh7g/SnrCtbjnWnI/AAAAAAAAAFk/c_rILJpHoxY/s1600-h/Picture1.png"&gt;&lt;img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer; width: 320px; height: 130px;" src="http://1.bp.blogspot.com/_306eXDjZh7g/SnrCtbjnWnI/AAAAAAAAAFk/c_rILJpHoxY/s320/Picture1.png" alt="" id="BLOGGER_PHOTO_ID_5366815991712406130" border="0" /&gt;&lt;/a&gt;&lt;/p&gt;&lt;/li&gt;&lt;li&gt;&lt;p&gt;&lt;a href="http://aclweb.org/anthology-new/D/D09/D09-1045.pdf"&gt;D09-1045&lt;/a&gt; [&lt;a href="http://aclweb.org/anthology-new/D/D09/D09-1045.bib"&gt;bib&lt;/a&gt;]: &lt;b&gt;Jeff Mitchell; Mirella Lapata&lt;/b&gt;&lt;br /&gt;&lt;i&gt;Language Models Based on Semantic Composition&lt;/i&gt;&lt;br /&gt;LM: Using simple VSM model for semantics small improvement over trigrams.&lt;br /&gt;&lt;/p&gt;&lt;/li&gt;&lt;li&gt;&lt;p&gt;&lt;a href="http://aclweb.org/anthology-new/W/W09/W09-2504.pdf"&gt;W09-2504&lt;/a&gt; [&lt;a href="http://aclweb.org/anthology-new/W/W09/W09-2504.bib"&gt;bib&lt;/a&gt;]: &lt;b&gt;Idan Szpektor; Ido Dagan&lt;/b&gt;&lt;br /&gt;&lt;i&gt;Augmenting WordNet-based Inference with Argument Mapping&lt;/i&gt;&lt;br /&gt;RTE: Some lexical substitutions require other words to be shuffled.  Automatic learning of shuffling rules using DIRT.&lt;br /&gt;&lt;/p&gt;&lt;/li&gt;&lt;li&gt;&lt;p&gt;&lt;a href="http://aclweb.org/anthology-new/W/W09/W09-2506.pdf"&gt;W09-2506&lt;/a&gt; [&lt;a href="http://aclweb.org/anthology-new/W/W09/W09-2506.bib"&gt;bib&lt;/a&gt;]: &lt;b&gt;Stefan Thater; Georgiana Dinu; Manfred Pinkal&lt;/b&gt;&lt;br /&gt;&lt;i&gt;Ranking Paraphrases in Context&lt;/i&gt;&lt;br /&gt;WSD: Using lexsub dataset.  No dictionary (I think).  VSM semantic representation.  Check Mitchell&amp;amp;Lapata, Erk&amp;amp;Pado for prior work.&lt;br /&gt;&lt;/p&gt;&lt;/li&gt;&lt;li&gt;&lt;p&gt;&lt;a href="http://aclweb.org/anthology-new/W/W09/W09-2507.pdf"&gt;W09-2507&lt;/a&gt; [&lt;a href="http://aclweb.org/anthology-new/W/W09/W09-2507.bib"&gt;bib&lt;/a&gt;]: &lt;b&gt;Kirk Roberts&lt;/b&gt;&lt;br /&gt;&lt;i&gt;Building an Annotated Textual Inference Corpus for Motion and Space&lt;/i&gt;&lt;/p&gt;&lt;/li&gt;&lt;li&gt;&lt;p&gt;&lt;a href="http://aclweb.org/anthology-new/W/W09/W09-2510.pdf"&gt;W09-2510&lt;/a&gt; [&lt;a href="http://aclweb.org/anthology-new/W/W09/W09-2510.bib"&gt;bib&lt;/a&gt;]: &lt;b&gt;David Clausen; Christopher D. Manning&lt;/b&gt;&lt;br /&gt;&lt;i&gt;Presupposed Content and Entailments in Natural Language Inference&lt;/i&gt;&lt;br /&gt;RTE: Example: "Mary lied about buying a car" -&gt; Mary did not buy a car.  "Mary regretted buying a car" -&gt; Mary bought a car.  "Mary thought about buying a car" -&gt; Uncertain.  Kartunnen 1975 presupposition projection.  Check out NatLog system (natural logic).&lt;/p&gt;&lt;/li&gt;&lt;li&gt;&lt;p&gt;&lt;a href="http://aclweb.org/anthology-new/D/D09/D09-1058.pdf"&gt;D09-1058&lt;/a&gt; [&lt;a href="http://aclweb.org/anthology-new/D/D09/D09-1058.bib"&gt;bib&lt;/a&gt;]: &lt;b&gt;Jun Suzuki; Hideki Isozaki; Xavier Carreras; Michael Collins&lt;/b&gt;&lt;br /&gt;&lt;i&gt;An Empirical Study of Semi-supervised Structured Conditional Models for Dependency Parsing&lt;/i&gt;&lt;br /&gt;Syntax: Take a look at earlier model in Suzuki, ACL'08.  What is with the q function?  Other work building on McDonald: Carreras '07, Koo '08.  MIRA training.&lt;br /&gt;&lt;/p&gt;&lt;/li&gt;&lt;li&gt;&lt;p&gt;&lt;a href="http://aclweb.org/anthology-new/D/D09/D09-1059.pdf"&gt;D09-1059&lt;/a&gt; [&lt;a href="http://aclweb.org/anthology-new/D/D09/D09-1059.bib"&gt;bib&lt;/a&gt;]: &lt;b&gt;Richard Johansson&lt;/b&gt;&lt;br /&gt;&lt;i&gt;Statistical Bistratal Dependency Parsing&lt;/i&gt;&lt;br /&gt;Syntax: Trying simultaneous parsing/SRL with joint probabilistic model.&lt;br /&gt;&lt;/p&gt;&lt;/li&gt;&lt;li&gt;&lt;p&gt;&lt;a href="http://aclweb.org/anthology-new/D/D09/D09-1060.pdf"&gt;D09-1060&lt;/a&gt; [&lt;a href="http://aclweb.org/anthology-new/D/D09/D09-1060.bib"&gt;bib&lt;/a&gt;]: &lt;b&gt;Wenliang Chen; Jun’ichi Kazama; Kiyotaka Uchimoto; Kentaro Torisawa&lt;/b&gt;&lt;br /&gt;&lt;i&gt;Improving Dependency Parsing with Subtrees from Auto-Parsed Data&lt;/i&gt;&lt;br /&gt;Syntax: Self training, SSL for parser.  Improvement, even though confidence in unlabeled text not well represented.  Best system does 46% of the sentences completely correct (unlabeled).&lt;br /&gt;&lt;/p&gt;&lt;/li&gt;&lt;li&gt;&lt;p&gt;&lt;a href="http://aclweb.org/anthology-new/D/D09/D09-1065.pdf"&gt;D09-1065&lt;/a&gt; [&lt;a href="http://aclweb.org/anthology-new/D/D09/D09-1065.bib"&gt;bib&lt;/a&gt;]: &lt;b&gt;Brian Murphy; Marco Baroni; Massimo Poesio&lt;/b&gt;&lt;br /&gt;&lt;i&gt;EEG responds to conceptual stimuli and corpus semantics&lt;/i&gt;&lt;br /&gt;Brain: Using EEG instead of fMRI in Mitchell style work. Why doesn't anybody try: (1) verbs, (2) grammaticality, (3) lie/truth, (4) agree/disagree, (5) complex grammatical constructs.&lt;br /&gt;&lt;/p&gt;&lt;/li&gt;&lt;li&gt;&lt;p&gt;&lt;a href="http://aclweb.org/anthology-new/D/D09/D09-1070.pdf"&gt;D09-1070&lt;/a&gt; [&lt;a href="http://aclweb.org/anthology-new/D/D09/D09-1070.bib"&gt;bib&lt;/a&gt;]: &lt;b&gt;Taesun Moon; Katrin Erk; Jason Baldridge&lt;/b&gt;&lt;br /&gt;&lt;i&gt;Unsupervised morphological segmentation and clustering with document boundaries&lt;/i&gt;&lt;br /&gt;Mor: help unsupervised morphology by assuming same stem more likely to appear in same document.&lt;br /&gt;&lt;/p&gt;&lt;/li&gt;&lt;li&gt;&lt;p&gt;&lt;a href="http://aclweb.org/anthology-new/D/D09/D09-1071.pdf"&gt;D09-1071&lt;/a&gt; [&lt;a href="http://aclweb.org/anthology-new/D/D09/D09-1071.bib"&gt;bib&lt;/a&gt;]: &lt;b&gt;Jurgen Van Gael; Andreas Vlachos; Zoubin Ghahramani&lt;/b&gt;&lt;br /&gt;&lt;i&gt;The infinite HMM for unsupervised PoS tagging&lt;/i&gt;&lt;br /&gt;Syntax: Use npbayes to pick the number of HMM states.  Directly use learnt HMM states rather than trying to map them to existing tagset.&lt;br /&gt;&lt;/p&gt;&lt;/li&gt;&lt;li&gt;&lt;p&gt;&lt;a href="http://aclweb.org/anthology-new/D/D09/D09-1072.pdf"&gt;D09-1072&lt;/a&gt; [&lt;a href="http://aclweb.org/anthology-new/D/D09/D09-1072.bib"&gt;bib&lt;/a&gt;]: &lt;b&gt;Qiuye Zhao; Mitch Marcus&lt;/b&gt;&lt;br /&gt;&lt;i&gt;A Simple Unsupervised Learner for POS Disambiguation Rules Given Only a Minimal Lexicon&lt;/i&gt;&lt;/p&gt;&lt;/li&gt;&lt;li&gt;&lt;p&gt;&lt;a href="http://aclweb.org/anthology-new/D/D09/D09-1085.pdf"&gt;D09-1085&lt;/a&gt; [&lt;a href="http://aclweb.org/anthology-new/D/D09/D09-1085.bib"&gt;bib&lt;/a&gt;]: &lt;b&gt;Laura Rimell; Stephen Clark; Mark Steedman&lt;/b&gt;&lt;br /&gt;&lt;i&gt;Unbounded Dependency Recovery for Parser Evaluation&lt;/i&gt;&lt;br /&gt;Syntax: same motivation as Onder's work.  Focuses on a particular construct difficult for parsers (accuracy &lt; 50%) and builds a test set.  Same problem in many fields (infrequent senses ignored in WSD, rare issues ignored in RTE/Semantics, rare constructs ignored in syntax, etc. etc.)&lt;br /&gt;&lt;/p&gt;&lt;/li&gt;&lt;li&gt;&lt;p&gt;&lt;a href="http://aclweb.org/anthology-new/D/D09/D09-1086.pdf"&gt;D09-1086&lt;/a&gt; [&lt;a href="http://aclweb.org/anthology-new/D/D09/D09-1086.bib"&gt;bib&lt;/a&gt;]: &lt;b&gt;David A. Smith; Jason Eisner&lt;/b&gt;&lt;br /&gt;&lt;i&gt;Parser Adaptation and Projection with Quasi-Synchronous Grammar Features&lt;/i&gt;&lt;br /&gt;Syntax: learn mapping between parsers with different output styles (e.g. how they connect auxiliary verbs).&lt;br /&gt;&lt;/p&gt;&lt;/li&gt;&lt;li&gt;&lt;p&gt;&lt;a href="http://aclweb.org/anthology-new/D/D09/D09-1087.pdf"&gt;D09-1087&lt;/a&gt; [&lt;a href="http://aclweb.org/anthology-new/D/D09/D09-1087.bib"&gt;bib&lt;/a&gt;]: &lt;b&gt;Zhongqiang Huang; Mary Harper&lt;/b&gt;&lt;br /&gt;&lt;i&gt;Self-Training PCFG Grammars with Latent Annotations Across Languages&lt;/i&gt;&lt;br /&gt;Syntax.&lt;br /&gt;&lt;/p&gt;&lt;/li&gt;&lt;li&gt;&lt;p&gt;&lt;a href="http://aclweb.org/anthology-new/D/D09/D09-1088.pdf"&gt;D09-1088&lt;/a&gt; [&lt;a href="http://aclweb.org/anthology-new/D/D09/D09-1088.bib"&gt;bib&lt;/a&gt;]: &lt;b&gt;Reut Tsarfaty; Khalil Sima’an; Remko Scha&lt;/b&gt;&lt;br /&gt;&lt;i&gt;An Alternative to Head-Driven Approaches for Parsing a (Relatively) Free Word-Order Language&lt;/i&gt;&lt;br /&gt;Syntax: Separate ordering information to get better coefficient stats in parser learning.  Many issues same as Turkish.&lt;br /&gt;&lt;/p&gt;&lt;/li&gt;&lt;li&gt;&lt;p&gt;&lt;a href="http://aclweb.org/anthology-new/D/D09/D09-1105.pdf"&gt;D09-1105&lt;/a&gt; [&lt;a href="http://aclweb.org/anthology-new/D/D09/D09-1105.bib"&gt;bib&lt;/a&gt;]: &lt;b&gt;Roy Tromble; Jason Eisner&lt;/b&gt;&lt;br /&gt;&lt;i&gt;Learning Linear Ordering Problems for Better Translation&lt;/i&gt;&lt;br /&gt;MT: Approximate solution to reordering problem for MT.  Shows improvement.  Does not make use of parse tree.&lt;br /&gt;&lt;/p&gt;&lt;/li&gt;&lt;li&gt;&lt;p&gt;&lt;a href="http://aclweb.org/anthology-new/D/D09/D09-1106.pdf"&gt;D09-1106&lt;/a&gt; [&lt;a href="http://aclweb.org/anthology-new/D/D09/D09-1106.bib"&gt;bib&lt;/a&gt;]: &lt;b&gt;Yang Liu; Tian Xia; Xinyan Xiao; Qun Liu&lt;/b&gt;&lt;br /&gt;&lt;i&gt;Weighted Alignment Matrices for Statistical Machine Translation&lt;/i&gt;&lt;br /&gt;MT: Compact representation for an alignment distribution.  Similar to forest for trees or lattice for segmentations.&lt;br /&gt;&lt;/p&gt;&lt;/li&gt;&lt;li&gt;&lt;p&gt;&lt;a href="http://aclweb.org/anthology-new/D/D09/D09-1107.pdf"&gt;D09-1107&lt;/a&gt; [&lt;a href="http://aclweb.org/anthology-new/D/D09/D09-1107.bib"&gt;bib&lt;/a&gt;]: &lt;b&gt;Matti Kääriäinen&lt;/b&gt;&lt;br /&gt;&lt;i&gt;Sinuhe – Statistical Machine Translation using a Globally Trained Conditional Exponential Family Translation Model&lt;/i&gt;&lt;br /&gt;MT: New MT engine based on structured learning.  Faster than moses with better TM scores, but overall lower BLEU.&lt;br /&gt;&lt;/p&gt;&lt;/li&gt;&lt;li&gt;&lt;p&gt;&lt;a href="http://aclweb.org/anthology-new/D/D09/D09-1108.pdf"&gt;D09-1108&lt;/a&gt; [&lt;a href="http://aclweb.org/anthology-new/D/D09/D09-1108.bib"&gt;bib&lt;/a&gt;]: &lt;b&gt;Hui Zhang; Min Zhang; Haizhou Li; Chew Lim Tan&lt;/b&gt;&lt;br /&gt;&lt;i&gt;Fast Translation Rule Matching for Syntax-based Statistical Machine Translation&lt;/i&gt;&lt;br /&gt;MT: Compact representation with fast search for packed forests.&lt;br /&gt;&lt;/p&gt;&lt;/li&gt;&lt;/ul&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-1214261912916945968?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://www.acl-ijcnlp-2009.org/' title='ACL 2009 Notes'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/1214261912916945968/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=1214261912916945968' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/1214261912916945968'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/1214261912916945968'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2009/08/acl-2009-notes.html' title='ACL 2009 Notes'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://1.bp.blogspot.com/_306eXDjZh7g/SnrCtbjnWnI/AAAAAAAAAFk/c_rILJpHoxY/s72-c/Picture1.png' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-767521694042949009</id><published>2009-08-04T12:31:00.002+03:00</published><updated>2010-11-03T08:27:27.306+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Publications'/><title type='text'>Modeling Morphologically Rich Languages Using Split Words and Unstructured Dependencies</title><content type='html'>Deniz Yuret and Ergun Bicici.  In &lt;i&gt;the Joint conference of the 47th Annual Meeting of the Association for Computational Linguistics and the 4th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing (ACL-IJCNLP 2009)&lt;/i&gt; (&lt;a href="http://aclweb.org/anthology-new/P/P09/P09-2087.pdf"&gt;PDF&lt;/a&gt;).&lt;br /&gt;&lt;span class="fullpost"&gt;&lt;br /&gt;&lt;iframe src="http://docs.google.com/present/embed?id=d2jm3f3_1002t6xp6dgc" frameborder="0" width="410" height="342"&gt;&lt;/iframe&gt;&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Abstract:&lt;/b&gt;  We experiment with splitting words into their stem and sufﬁx components for modeling morphologically rich languages. We show that using a morphological analyzer and disambiguator results in a signiﬁcant perplexity reduction in Turkish.   We present ﬂexible n-gram models, Flex-Grams, which assume that the n−1 tokens that determine the probability of a given token can be chosen anywhere in the sentence rather than the preceding n − 1 positions. Our ﬁnal model achieves 27% perplexity reduction compared to the standard n-gram model.&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-767521694042949009?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://aclweb.org/anthology-new/P/P09/P09-2087.pdf' title='Modeling Morphologically Rich Languages Using Split Words and Unstructured Dependencies'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/767521694042949009/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=767521694042949009' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/767521694042949009'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/767521694042949009'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2009/08/modeling-morphologically-rich-languages.html' title='Modeling Morphologically Rich Languages Using Split Words and Unstructured Dependencies'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-117563967510190365</id><published>2009-07-22T04:31:00.001+03:00</published><updated>2011-09-22T19:08:57.423+03:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Publications'/><title type='text'>Morphological cues vs. number of nominals in learning verb types in Turkish: The syntactic bootstrapping mechanism revisited</title><content type='html'>A. Engin Ural;  Deniz Yuret;  F. Nihan Ketrez;  Dilara Koçbaş; Aylin C. Küntay. &lt;i&gt;Language and Cognitive Processes, 24(10), pp. 1393-1405, December 2009&lt;/i&gt; (&lt;a href="https://docs.google.com/viewer?a=v&amp;pid=explorer&amp;chrome=true&amp;srcid=0B6C4-zOYlkxsZjMxZDE0MWItNmI3Yy00YWZlLWEzYzEtZTZhYTNlYjc3ZDI5&amp;hl=en"&gt;PDF&lt;/a&gt;, &lt;a href="http://www.informaworld.com/smpp/ftinterface~content=a913314122~fulltext=713240930"&gt;PDF&lt;/a&gt;, &lt;a href="http://www.informaworld.com/smpp/ftinterface~content=a913314122~fulltext=713240928"&gt;HTML&lt;/a&gt;).&lt;span class="fullpost"&gt;&lt;br /&gt;&lt;br /&gt;Abstract:  The syntactic bootstrapping mechanism of verb learning was evaluated against child-directed speech in Turkish, a language with rich morphology, nominal ellipsis and free word order. Machine-learning algorithms were run on transcribed caregiver speech directed to two Turkish learners (one hour every two weeks between 0;9 to 1;10) of different socioeconomic backgrounds. We found that the number of nominals in child-directed utterances plays a small, but significant, role in classifying transitive and intransitive verbs. Further, we found that accusative morphology on the noun is a strong cue in clustering verb types. We also found that verbal morphology (past tense and bareness of verbs) is useful in distinguishing between different subtypes of intransitive verbs. These results suggest that syntactic bootstrapping mechanisms should be extended to include morphological cues to verb learning in morphologically rich languages.&lt;br /&gt;&lt;br /&gt;Keywords: Language development; Turkish; Child-directed speech, Syntactic bootstrapping; Morphology &lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-117563967510190365?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://www.informaworld.com/smpp/content~db=all~content=a913314122' title='Morphological cues vs. number of nominals in learning verb types in Turkish: The syntactic bootstrapping mechanism revisited'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/117563967510190365/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=117563967510190365' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/117563967510190365'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/117563967510190365'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2009/07/morphological-cues-vs-number-of.html' title='Morphological cues vs. number of nominals in learning verb types in Turkish: The syntactic bootstrapping mechanism revisited'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-3164488552455901646</id><published>2009-04-29T10:59:00.002+03:00</published><updated>2009-11-04T17:21:25.718+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Links'/><title type='text'>Natural Language Processing summer course at Sabanci University</title><content type='html'>This summer Kemal Oflazer, Dilek Hakkani-Tur ve Gokhan Tur are offering a Statistical Natural Language Processing course at Sabanci University.  A draft syllabus is included below.&lt;br /&gt;&lt;span class="fullpost"&gt;&lt;br /&gt;&lt;b&gt;&lt;div&gt;STATISTICAL NLP CLASS:&lt;/div&gt;&lt;/b&gt; &lt;br /&gt;&lt;br /&gt;&lt;div&gt;&lt;b&gt;Kemal Oflazer:&lt;/b&gt;&lt;/div&gt; &lt;div&gt;&lt;ul style="background-color: rgb(255, 255, 255);"&gt;&lt;li&gt;Overview of NLP (2 hours)&lt;/li&gt;&lt;ul&gt;&lt;li&gt;NLP Applications&lt;/li&gt;&lt;li&gt;Processing pipeline: Basic steps and how they feed into each other and how they are used by applications&lt;/li&gt;&lt;/ul&gt;&lt;/ul&gt;&lt;ul style="background-color: rgb(255, 255, 255);"&gt;&lt;li&gt; Morphological Analysis (could be skipped or shortened) (2 hours)&lt;/li&gt;&lt;/ul&gt;&lt;ul style="background-color: rgb(255, 255, 255);"&gt;&lt;li&gt;Introduction to Statistical Models, n-gram language modeling, (2hours)&lt;/li&gt;&lt;ul&gt;&lt;li&gt;Applications to simple sequence problems (tagging English and/or deascifier)&lt;/li&gt;&lt;/ul&gt;&lt;/ul&gt;&lt;ul&gt;&lt;li&gt;Morphological Disambiguation (applications to Turkish)&lt;/li&gt;&lt;/ul&gt;&lt;ul style="background-color: rgb(255, 255, 255);"&gt;&lt;li&gt;  HMMs (formal treatment (backward-forward + viterbi) + applications to tagging) (2-3 hours)&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt; &lt;ul&gt;&lt;li style="background-color: rgb(255, 255, 255);"&gt;CFGs and Probabilistic CFGs (3-4 hours)&lt;/li&gt;&lt;ul style="background-color: rgb(255, 255, 255);"&gt;&lt;li&gt;Inside-outside algorithm for training PCFGs&lt;/li&gt;&lt;li&gt;Parsing with PCFGs&lt;/li&gt;&lt;/ul&gt;&lt;li style="background-color: rgb(255, 255, 255);"&gt;Machine Translation (MT) (3-4 Hours)&lt;/li&gt;&lt;ul&gt;&lt;li style="background-color: rgb(255, 255, 255);"&gt;Brief overview Classical Symbolic MT&lt;/li&gt;&lt;li style="background-color: rgb(255, 255, 255);"&gt;  Statistical Machine Translation&lt;/li&gt;&lt;ul style="background-color: rgb(255, 255, 255);"&gt;&lt;li&gt;Word-based Models&lt;/li&gt;&lt;li&gt;Phrase-based Models&lt;/li&gt;&lt;li&gt;Syntax-based models&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;&lt;span style="background-color: rgb(255, 255, 255);"&gt;Dealing with Morphology in SMT&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;&lt;/ul&gt; &lt;div&gt;  &lt;/div&gt; &lt;br /&gt;&lt;br /&gt;&lt;div&gt;&lt;b&gt;Dilek Hakkani-Tur:&lt;/b&gt;&lt;/div&gt; &lt;div&gt; &lt;/div&gt; &lt;ul&gt;&lt;div&gt; &lt;li&gt;Elements of Information Theory / Advanced Language Modeling and Applications&lt;br /&gt;&lt;/li&gt;&lt;/div&gt;&lt;ul&gt;&lt;li&gt;Entropy/Perplexity/Mutual Information&lt;/li&gt;&lt;li&gt;Noisy Channel Model&lt;/li&gt;&lt;ul&gt;&lt;li&gt;Sequence classification / HMM&lt;/li&gt;&lt;li&gt;Sample classification / Naive Bayes&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;Smoothing&lt;/li&gt;&lt;li&gt; &lt;div&gt;Adaptation&lt;/div&gt;&lt;/li&gt;&lt;/ul&gt;&lt;div&gt; &lt;li&gt;Named Entity Extraction (NE)&lt;/li&gt;&lt;/div&gt;&lt;ul&gt;&lt;li&gt;Using HMM for NE&lt;/li&gt;&lt;/ul&gt;&lt;ul&gt;&lt;li&gt;Using CRF for NE&lt;/li&gt;&lt;li&gt;Using Boosting/MaxEnt/SVM for NE&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;Spoken Language Understanding (SLU) as Template Filling&lt;/li&gt;&lt;ul&gt;&lt;li&gt; &lt;div&gt;HMM approaches (AT&amp;amp;T vs BBN)&lt;/div&gt;&lt;/li&gt;&lt;li&gt; &lt;div&gt;Hidden Vector State Models&lt;/div&gt;&lt;/li&gt;&lt;li&gt; &lt;div&gt;Latent Semantic Analysis&lt;/div&gt;&lt;/li&gt;&lt;li&gt; &lt;div&gt;Sample-classification based (Boosting/MaxEnt/Decision Trees)&lt;/div&gt;&lt;/li&gt;&lt;/ul&gt;&lt;li&gt; &lt;div&gt;Summarization &lt;/div&gt;&lt;/li&gt;&lt;ul&gt;&lt;li&gt;Greedy Algorithms, MMR&lt;/li&gt;&lt;li&gt;TextRank/LexRank&lt;/li&gt;&lt;li&gt;Classification based extractive summarization&lt;/li&gt;&lt;li&gt;Global Models for Summarization: Linear Programming approaches&lt;/li&gt;&lt;/ul&gt;&lt;li&gt; &lt;div&gt;Question Answering&lt;/div&gt;&lt;/li&gt;&lt;li&gt; &lt;div&gt;Spoken Dialog Systems and Dialog Management (DM) &lt;/div&gt;&lt;/li&gt;&lt;ul&gt;&lt;li&gt; &lt;div&gt;Dialog Systems&lt;/div&gt;&lt;/li&gt;&lt;li&gt; &lt;div&gt;DM&lt;/div&gt;&lt;/li&gt;&lt;ul&gt;&lt;li&gt; &lt;div&gt;Finite State Models&lt;/div&gt;&lt;/li&gt;&lt;li&gt; &lt;div&gt;Agent Models&lt;/div&gt;&lt;/li&gt;&lt;li&gt; &lt;div&gt;Reinforcement Learning&lt;/div&gt;&lt;/li&gt;&lt;/ul&gt;&lt;/ul&gt;&lt;/ul&gt; &lt;div&gt; &lt;/div&gt; &lt;br /&gt;&lt;br /&gt;&lt;div&gt;&lt;b&gt;Gokhan Tur&lt;/b&gt;&lt;/div&gt; &lt;div&gt;    &lt;/div&gt; &lt;ul&gt;&lt;li&gt; &lt;div&gt;Topic Classification&lt;/div&gt;&lt;/li&gt;&lt;ul&gt;&lt;li&gt; &lt;div&gt;Discriminative classification: SVM/Boosting&lt;/div&gt;&lt;/li&gt;&lt;li&gt; &lt;div&gt;Generative classification: language model, document similarity, vector-space-model&lt;/div&gt;&lt;/li&gt;&lt;li&gt; &lt;div&gt;Feature selection/transformation (LDA)&lt;/div&gt;&lt;/li&gt;&lt;li&gt; &lt;div&gt;Latent semantic indexing&lt;/div&gt;&lt;/li&gt;&lt;/ul&gt;&lt;li&gt; &lt;div&gt;SLU as Intent Determination &lt;/div&gt;&lt;/li&gt;&lt;ul&gt;&lt;li&gt; &lt;div&gt;Semantic Role Labeling&lt;/div&gt;&lt;/li&gt;&lt;li&gt; &lt;div&gt;Robustness to ASR&lt;/div&gt;&lt;/li&gt;&lt;/ul&gt;&lt;li&gt; &lt;div&gt;Topic Clustering&lt;/div&gt;&lt;/li&gt;&lt;ul&gt;&lt;li&gt; &lt;div&gt;K-Means&lt;/div&gt;&lt;/li&gt;&lt;li&gt; &lt;div&gt;Top/Down vs. Bottom/Up&lt;/div&gt;&lt;/li&gt;&lt;/ul&gt;&lt;li&gt; &lt;div&gt;Topic Segmentation&lt;/div&gt;&lt;/li&gt;&lt;ul&gt;&lt;li&gt; &lt;div&gt;HMM&lt;/div&gt;&lt;/li&gt;&lt;li&gt; &lt;div&gt;TextTiling&lt;/div&gt;&lt;/li&gt;&lt;li&gt; &lt;div&gt;Markov Chains&lt;/div&gt;&lt;/li&gt;&lt;/ul&gt;&lt;li&gt; &lt;div&gt;Sentence Segmentation&lt;/div&gt;&lt;/li&gt;&lt;ul&gt;&lt;li&gt; &lt;div&gt;HMM&lt;/div&gt;&lt;/li&gt;&lt;li&gt; &lt;div&gt;CRF&lt;/div&gt;&lt;/li&gt;&lt;li&gt; &lt;div&gt;Hybrid&lt;/div&gt;&lt;/li&gt;&lt;/ul&gt;&lt;li&gt;Active Learning/Semi-Supervised Learning/Unsupervised Learning/Model Adaptation/Robustness&lt;/li&gt;&lt;/ul&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-3164488552455901646?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/3164488552455901646/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=3164488552455901646' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/3164488552455901646'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/3164488552455901646'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2009/04/natural-language-processing-summer.html' title='Natural Language Processing summer course at Sabanci University'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-111964927104730893</id><published>2009-04-26T12:33:00.000+03:00</published><updated>2010-11-03T09:08:54.396+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Math'/><title type='text'>Deniz Yuret's Math Problems</title><content type='html'>This is a list of elementary problems I like for one reason or another from various branches of mathematics.  I cited the people I heard the problems from, they are not necessarily the originators. The unsolved flag just means that the problem is yet unsolved by me. Send me a solution if you find one.  You can send me any interesting problems you find by adding a comment to this post.  Also check out my other &lt;a href="http://denizyuret.blogspot.com/search/label/Math%20Problems"&gt;math posts&lt;/a&gt;, especially &lt;a href="http://denizyuret.blogspot.com/2005/06/probability-twisters.html"&gt;probability twisters&lt;/a&gt;, &lt;a href="http://denizyuret.blogspot.com/2005/06/unsolved-problems.html"&gt;unsolved elementary problems&lt;/a&gt;, and &lt;a href="http://denizyuret.blogspot.com/2005/06/math-links.html"&gt;links to other math sites&lt;/a&gt;.  &lt;i&gt;Last update: April 26, 2009.&lt;/i&gt;&lt;span class="fullpost"&gt;&lt;br /&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt; (Ahmed Roman) Nathan and Abi are playing a game. Abi always goes first. The players take turns changing a positive integer to a smaller one and then passing the smaller number back to their opponent. On each move, a player may either subtract one from the integer or halve it, rounding down if necessary. Thus, from 28 the legal moves are to 27 or to 14; from 27, the legal moves are to 26 or to 13. The game ends when the integer reaches 0. The player who makes the last move wins. For example, if the starting integer is 15, Abi might move to 7, Nathan to 6, Abi to 3, Nathan to 2, Abi to 1, and now Nathan moves to 0 and wins. (However, in this sample game Abi could have played better!)&lt;br /&gt;&lt;ol&gt;&lt;li&gt;Assuming both Nathan and Abi play according to the best possible strategy, who will win if the starting integer is 1000? 2000? Prove your answer.&lt;br /&gt;&lt;/li&gt;&lt;li&gt;As you might expect, for some starting integers Abi will win and for others Nathan will win. If we pick a starting integer at random from all the integers from 1 to n inclusive, we can consider the probability of Nathan winning. This probability will fluctuate as n increases, but what is its limit as n tends to infinity? Prove your answer.&lt;br /&gt;&lt;/li&gt;&lt;/ol&gt;&lt;/li&gt;&lt;li&gt; (Serdar Tasiran) We have an n boxes, k of them have a ball in them (k&lt;=n), the others are empty.  We start opening the boxes in a random order and stop when we find a ball.  What is the expected number of boxes we will open?&lt;br /&gt;&lt;/li&gt;&lt;li&gt; (Ertem Esiner) Two mathematicians meet an old friend who has two kids and ask her what their ages are.  The mother writes the sum of the two ages on a piece of paper and gives it to the first mathematician, and gives the product of the two ages to the second mathematician.  The mathematicians think for a minute and claim the information is not sufficient.  The woman says "think again".  At that moment the mathematician with the product figures out the answer.  How old are the two kids?&lt;br /&gt;&lt;/li&gt;&lt;li&gt; (Ertem Esiner) Five pirates are trying to divide up 1000 gold pieces among themselves.  They decide to take turns making offers.  An offer by a pirate is accepted if at least half of the other pirates agree to it.  Otherwise the pirate making the offer is killed, and the next one makes an offer.  Each pirate is greedy, but none wants to die.  If you are the first pirate, what offer do you make?&lt;br /&gt;&lt;/li&gt;&lt;li&gt; (Drake) A professor announces that there will be surprise exam the following week, specifically that the students will not know what day the exam is going to take place.  The students reason that the exam cannot take place on Friday, because by Thursday night they would know what day the exam is going to take place.  If the exam cannot be on Friday it cannot be on Thursday either, because they would know by Wednesday night and so on.  They finally decide the exam cannot take place and forget to study.  The professor gives the exam on Wednesday to everybody's surprise.  What went wrong?&lt;br /&gt;&lt;/li&gt;&lt;li&gt; (Mackay) Fred rolls an unbiased six-sided die once per second, noting the occasions when the outcome is a six.&lt;br /&gt;&lt;ol&gt; &lt;li&gt; What is the mean number of rolls from one six to the next six?&lt;br /&gt;&lt;/li&gt;&lt;li&gt; Between two rolls, the clock strikes one. What is the mean number of rolls until the next six?&lt;br /&gt;&lt;/li&gt;&lt;li&gt; Now think back before the clock struck. What is the mean number of rolls, going back in time, until the most recent six?&lt;br /&gt;&lt;/li&gt;&lt;li&gt; What is the mean number of rolls from the six before the clock struck to the next six?&lt;br /&gt;&lt;/li&gt;&lt;li&gt; Is your first answer different from your last answer?  Explain.&lt;br /&gt;&lt;/li&gt;&lt;/ol&gt;&lt;/li&gt;&lt;li&gt; (Deniz) You have two independent random variables between 0 and 1.  How do you decide which one is more likely to be larger than the other?&lt;br /&gt;&lt;/li&gt;&lt;li&gt; (Deniz) You have two arbitrary random variables between 0 and 1.  How do you decide if they are independent or not looking at their joint pdf density plot?&lt;br /&gt;&lt;/li&gt;&lt;li&gt; (Dennis Eriksson) Find all solutions to the diophantine equation: 1+2+3+...+n=m^2, where n and m are positive integers.&lt;br /&gt;&lt;/li&gt;&lt;li&gt; (Feyz) You have a deck of n cards numbered from 1 to n. dealt and shuffled randomly. What is the probability that none of the i-th card is on the i-th position?&lt;br /&gt;&lt;/li&gt;&lt;li&gt; (Sonny) Prove that there is a natural number n, for which 2^n starts with the numbers 3141592, i.e., show that there is a number of the form 3141592....  which is a power of 2 (in base 10 representation).&lt;br /&gt;&lt;/li&gt;&lt;li&gt; (Alkan) Let n, k be integers greater than 1.&lt;br /&gt;&lt;ol&gt; &lt;li&gt; Show that 1/1 + 1/2 + 1/3 +...+ 1/n cannot be an integer.&lt;br /&gt;&lt;/li&gt;&lt;li&gt; Show that 1/k + 1/(k+1) + ... + 1/(k+n) cannot be an integer.&lt;br /&gt;&lt;/li&gt;&lt;/ol&gt;&lt;/li&gt;&lt;li&gt; (Will,Minsky) These problems have something in common: &lt;br /&gt;&lt;ol&gt; &lt;li&gt; A monk leaves to ascend to the temple on top of a mountain at 9am and arrives at 5pm. The next day he leaves the temple at 9am and arrives back at the foot of the mountain at 5pm. Prove that there is a point in time where he was at the same location on the path at the same time.&lt;br /&gt;&lt;/li&gt;&lt;li&gt; Prove that on a 2D earth, there exists a diameter such that the temperature at the endpoints is equal.&lt;br /&gt;&lt;/li&gt;&lt;li&gt; Prove that on a 3D earth, there exists a diameter such that the temperature and humidity of the endpoints are equal.&lt;br /&gt;&lt;/li&gt;&lt;li&gt; Does every convex closed curve in the plane contain all four vertices of some square?&lt;br /&gt;&lt;/li&gt;&lt;/ol&gt;&lt;/li&gt;&lt;li&gt; (Ben) Two nice algorithm questions: Given a shuffled array of numbers from 1 to 10,000, find the three that are missing in one pass.  Given an array of positive and negative integers, find the subarray with the highest sum in linear time. &lt;br /&gt;&lt;/li&gt;&lt;li&gt; (Winston) Four people want to pass a bridge dark at night.  They can walk the bridge in 1, 2, 9, and 10 minutes respectively.  The bridge can carry at most two people at a time.  There is a single flash-light, and they need the flash-light to walk on the bridge.  What is the shortest time for all four to pass across?  (This was apparently a popular Microsoft interview question). &lt;br /&gt;&lt;/li&gt;&lt;li&gt; (Beril) You have a glass of tea and a glass of milk.  You take a spoonfull of milk, mix it with the tea.  Then you take a spoonfull of this mixture and mix it with the milk.  Is there more milk in the tea or more tea in the milk at the end? &lt;br /&gt;&lt;/li&gt;&lt;li&gt; (Mine) Construct a square from: (a) Three identical squares, (b) Any two squares, (c) A rectangle.  (d) Divide a square into any given two squares with the same total area.  (e) Divide a circle into 6, 7, 8, and 10 equal pie-slices. &lt;br /&gt;&lt;/li&gt;&lt;li&gt; (IMO practice) m+n people are standing on a movie line.  m people have 5 dollar bills, n people have 10 dollar bills.  The movie is 5 dollars.  The cashier opens with no money.  It will close if it does not have enough change to give one person.  How many possible lines are there that will get through without closing the cashier? &lt;br/&gt;&lt;b&gt;Note: &lt;/b&gt;(Deniz, Mar 10, 1998) I just discovered that this problem is equivalent to finding the number of full binary trees with m leaves when n=m-1.  A full binary tree is a tree where each node has 0 or 2 children.  The number of binary trees is equivalent to the number of shift-reduce sequences that parse them.  For such a sequence to be valid the number of shifts need to always be ahead of the number of reduces, which turns this into our movie problem.  The binary tree problem can also be solved using a generating function and the relation b[n] = sum[k=1..n-1](b[k] b[n-k]).  The movie problem can be solved by using random walks and the reflection principle.  The two solutions seem to give different answers but they turn out to be equivalent.  This constitutes an indirect proof of the following combinatorial identity: (2n-1)!! = 2n!/(n! 2^n).  Everything is related to everything else in math :-)&lt;br /&gt;&lt;/li&gt;&lt;li&gt; (Oguz) Find a function f on real numbers such that f(f(x)) = -x. &lt;br /&gt;&lt;/li&gt;&lt;li&gt; (Boris) You meet many women in your life.  After meeting each one, you decide how good she is and whether you want to marry her.  If you decide to marry her, you lose your chance with future candidates.  If you decide to move on, you lose your chance with her.  Assuming you will meet at most n women, find the optimum strategy for marrying the best bride.  &lt;br&gt;&lt;font size=-1&gt; (The Azeri mathematician Gussein-Zade is apparently the first one to solve this problem.)  &lt;/font&gt; &lt;br /&gt;&lt;/li&gt;&lt;li&gt; (Alkan) A small rectangle is cut out of a large rectangle.  Find a line that divides the remaining figure into two equal areas using an unmarked ruler. &lt;br /&gt;&lt;/li&gt;&lt;li&gt; (IMO) Let A be a set of ten two-digit integers.  Prove that one can always find two subsets of A with the same sum. &lt;br /&gt;&lt;/li&gt;&lt;li&gt; (IMO) 17 people correspond with each other.  Each pair discusses one of three possible topics.  Prove that there are three people that discuss the same topic with each other. &lt;br /&gt;&lt;/li&gt;&lt;li&gt; (Alkan) Five couples meet in a party.  Everyone starts shaking hands with everyone else except their partners.  At some point the host stops them and asks how many handshakes each had.  Everyone gives a different number.  How many hands did the host's wife shake? &lt;br /&gt;&lt;/li&gt;&lt;li&gt; (IMO-75/4) When 4444 &lt;sup&gt; 4444 &lt;/sup&gt; is written in decimal notation, the sum of its digits is A.  Let B be the sum of the digits of A.  Find the sum of the digits of B.  (A and B are written in decimal notation.) &lt;br /&gt;&lt;/li&gt;&lt;li&gt; (Murat Fadiloglu) What is the probability of two randomly selected integers being mutually prime? &lt;br /&gt;&lt;/li&gt;&lt;li&gt; (Alkan) An old lady buys a plane ticket to visit her son.  She goes to the airport and people let her board first.  Since she can't read her seat number, she sits on a random seat.  Rest of the passengers sit on their own seats, unless it is occupied in which case they randomly choose one of the emtpy seats.  What is the probability that the last passenger will sit on his own seat? &lt;br /&gt;&lt;/li&gt;&lt;li&gt; (Alkan) sqrt(1 + 2*sqrt(1 + 3*sqrt(1 + 4*sqrt(1 + 5*sqrt(1 + ... ))...) = ?  &lt;br /&gt;&lt;/li&gt;&lt;li&gt; (Rota) Given a sequence of (n &lt;sup&gt; 2 &lt;/sup&gt; + 1) distinct integers, show that it is possible to find a sequence of (n+1) entries which is increasing or decreasing. &lt;br /&gt;&lt;/li&gt;&lt;li&gt; (Minkowsky) Consider a two dimensional lattice with grid points unit distance apart.  Show that a convex shape that is symmetric around a grid point has to include at least three grid points if its area is 4. &lt;br /&gt;&lt;/li&gt;&lt;li&gt; (Science Museum, unsolved) Which rectangles can be divided into unequal squares? &lt;br /&gt;&lt;/li&gt;&lt;li&gt; (Alkan) Consider permutations of an array which contains n copies of each integer from 1 to n.  Two permutations are defined as orthogonal if their corresponding elements form distinct pairs.  What is the maximum number of permutations such that any two are orthogonal?  For example, here is a possible set of mutually orthogonal permutations for n=3: {111222333, 123123123, 123231312, 123312231}. &lt;br /&gt;&lt;/li&gt;&lt;li&gt; (Ian) My new favorite algorithm: Find an algorithm that discovers if a linked list has a cycle in it using bounded memory. &lt;br /&gt;&lt;/li&gt;&lt;li&gt; (Uttrash) You are in prison and they give you n red balls, n green balls and two boxes.  You are to place the balls in the two boxes in any way you like.  The next day they will pick a ball from one of the boxes, and if it is green you will be set free.  How do you arrange the balls? &lt;br /&gt;&lt;/li&gt;&lt;li&gt; (Will) You randomly throw k balls into n bins.  What is the expected number of occupied bins. &lt;br /&gt;&lt;/li&gt;&lt;li&gt; (Will) You randomly throw k points on the unit interval.  What is the expected length of the longest segment. &lt;br /&gt;&lt;/li&gt;&lt;li&gt; (Michael, unsolved) You distribute 100 balls to 10 buckets.  What is the expected value of the number of balls in the bucket with most balls. &lt;br /&gt;&lt;/li&gt;&lt;li&gt; (Lon) Draw 2 circles, 1 completely inside the other (but not necessarily concentric.) What is the probablility that a line intersecting the outer circle also intersects the inner circle. Now, do the same with rectangles. &lt;br /&gt;&lt;/li&gt;&lt;li&gt; (Thurston and Conway) An angel is stuck on an infinite sheet of graph paper, he can hop from square to adjacent square.  Everytime the angel hops, the devil can knock out any square, so the angel can't ever go there.  Can the devil trap the angel?  What if the graph paper is a positive quadrant (i.e. is bounded on two sides). &lt;br /&gt;&lt;/li&gt;&lt;li&gt; This is not really math, but here are my two favorite algorithms: (1) Find an algorithm for perfect shuffling of an array.  (2) Find an algorithm that will pick a perfectly random element from a list in one pass without knowing the size of the list beforehand. &lt;br /&gt;&lt;/li&gt;&lt;li&gt; (Alkan) Given two points find the point midway between them using only a compass (no ruler). &lt;br /&gt;&lt;/li&gt;&lt;li&gt; (Alkan) You are sitting at point [0,0] and looking towards right into a tunnel bounded by y=1/x and y=-1/x curves.  The walls of the tunnel are reflecting.  Prove that if you send a light beam into the tunnel in any direction other than straight to the right, the beam will be reflected back towards left. &lt;br /&gt;&lt;/li&gt;&lt;li&gt; (Deniz) Let x be a random variable which can take positive integer values.  P(x)=1/2&lt;sup&gt;x&lt;/sup&gt;.  We draw n random elements from this distribution.  What is the probability that the n+1st element will be different from the first n? &lt;br /&gt;&lt;/li&gt;&lt;li&gt; (Alkan) Let A be the set of all rectangles that have one integer side.  Prove that any rectangle constructed by concatenating rectangles from A will also be a member of A. &lt;br /&gt;&lt;/li&gt;&lt;li&gt; (Neal) Take a randomly shuffled deck.  Open the cards one by one.  At one point stop and predict that the next card is red.  Is there a strategy that has more than 1/2 chance. &lt;br /&gt;&lt;/li&gt;&lt;li&gt; Pick two random points in the unit line segment.  What is the expected distance between them? &lt;br /&gt;&lt;/li&gt;&lt;li&gt; Pick two random points in the unit circle.  What is the expected distance between them? &lt;br /&gt;&lt;/li&gt;&lt;li&gt; (Umit) Suspend a rope from two points on the ceiling.  What shape does it take? &lt;br /&gt;&lt;/li&gt;&lt;li&gt; (Bernoulli brothers) A ball is rolling from point A to a lower point B.  What is the ideal curve for the path between A and B that minimize the travel time? &lt;br /&gt;&lt;/li&gt;&lt;li&gt; (Alkan) There are 100 light poles with switches on a street.  In the beginning all lights are off.  One person goes through pole number 1, 2, 3, ... and flips the switches.  Then he goes back and goes through 2, 4, 6, ... and flips the switches.  Then he goes back and goes through 3, 6, 9, ... and flips the switches.  So at n'th round he flips the multiples of n.  Which lights are on after 100 rounds? &lt;br /&gt;&lt;/li&gt;&lt;li&gt; (Deniz) There is a set A of n0 elements, and we randomly pick a subset B of n1 elements.  We know that r0 of the elements in A were red.  We are interested in finding out the number of red elements in B, r1.  To find out we start picking random elements from B.  We pick n2 elements, and r2 of them turn out to be red.  Now what is the best estimate for r1? &lt;br /&gt;&lt;/li&gt;&lt;li&gt; (Minsky) I bring you three flipped cups and tell you there is gold under one of them.  Furthermore, each cup has a number giving the probability that the gold is under that one.  You immediately go to the one with highest probability.  I tell you that you have amnesia and I may have tried this on you a million times.  What is your best strategy?  &lt;br /&gt;&lt;/li&gt;&lt;li&gt; (Minsky) An ant leaves a repeated binary pattern behind as it walks through the desert.  What is the length of the shortest pattern that would let you distinguish which way the ant was going? &lt;br /&gt;&lt;/li&gt;&lt;li&gt; (Feyzu) Two points are chosen at random on a line AB, each point being chosen according to the uniform distribution on AB, and the choices being made independently of each other. The line AB may now be regarded as divided into three parts. What is the probability that they may be made into a triangle? &lt;br /&gt;&lt;/li&gt;&lt;li&gt; (IMO practice) The entries for a competition is locked in a safe.  There are 11 judges.  We would like them to be able to open the safe when more than half get together.  How many locks / keys do we need? &lt;br /&gt;&lt;/li&gt;&lt;li&gt; (IMO practice) Given three parallel lines, show that an equilateral triangle can always be constructed with a vertex on each line. &lt;br /&gt;&lt;/li&gt;&lt;li&gt; (IMO-72/6) Given four distinct parallel planes, prove that there exists a regular tetrahedron with a vertex on each plane. &lt;br /&gt;&lt;/li&gt;&lt;li&gt; (Umit) A method for two people to share a pie fairly is to let one cut the other one pick.  Generalize this method to n people. &lt;br /&gt;&lt;/li&gt;&lt;li&gt; You are making a random walk on an n-dimensional grid.  What is the probability that you will ever return to the origin?  (Hint: It is 1 for 1-D and 2-D!  It is 0.3405 for 3-D). &lt;br /&gt;&lt;/li&gt;&lt;li&gt; (Ivanie) A rabbit hopping up the stairs can hop either one or two steps at a time.  How many different ways can it climb n steps? &lt;br /&gt;&lt;/li&gt;&lt;li&gt; Show that if you cut off two opposite corner squares of a chess board, you cannot cover the rest with dominoes. &lt;br /&gt;&lt;/li&gt;&lt;li&gt; Show that n squares with total area less than 1/2 can always be fit into a unit square (non-overlapping). &lt;br /&gt;&lt;/li&gt;&lt;li&gt; Show that n squares with total area greater than 3 can always cover the surface of the unit square (non-overlapping). &lt;br /&gt;&lt;/li&gt;&lt;li&gt; You color all points of a plain with three colors.  Show that I can always find two points of the same color that are a given distance apart. &lt;br /&gt;&lt;/li&gt;&lt;li&gt; You color all points on an equilateral triangle with two colors.  I try to find a right triangle with its vertices on the edges of your triangle and all vertices having the same color.  Can you find a coloring that prevents this? &lt;br /&gt;&lt;/li&gt;&lt;li&gt; How many 1's are there in the digits of numbers from 1 to 1 million?  (one minute time limit). &lt;br /&gt;&lt;/li&gt;&lt;li&gt; (Michael) There is a piece of candy on every node of a binary tree.  Find the shortest path through the binary tree that lets you collect all of the candies. &lt;br /&gt;&lt;/li&gt;&lt;li&gt; (Ihsan) Two men, x distance apart, start walking toward each other with speed v.  At that instant a fly starts flying from one men's nose to the other with 2v speed.  The fly keeps flying back and forth between the two noses until the guys meet.  How much distance has the fly flown when they meet?  (There is an easy way and a hard way to solve this). &lt;br /&gt;&lt;/li&gt;&lt;li&gt; (Ihsan) A coin is flipped until the first head appears.  If you get a head in n flips you win $2&lt;sup&gt;n&lt;/sup&gt;.  How much are you willing to pay to play this game? &lt;br /&gt;&lt;/li&gt;&lt;li&gt; (Bilim ve Teknik) You need to paint the area under the curve 1/x.  How can you do it with a finite amount of paint? &lt;br /&gt;&lt;/li&gt;&lt;/ul&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-111964927104730893?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/111964927104730893/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=111964927104730893' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/111964927104730893'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/111964927104730893'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2005/06/deniz-yurets-math-problems.html' title='Deniz Yuret&apos;s Math Problems'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-5085935951356909337</id><published>2009-04-04T19:11:00.014+03:00</published><updated>2010-12-29T13:07:35.633+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Books'/><title type='text'>Dennett in Istanbul</title><content type='html'>As part of &lt;a href="http://www.sabanciuniv.edu/eng/anasayfa/anasayfa.php"&gt;Sabanci University&lt;/a&gt;'s &lt;a href="http://www.sabanciuniv.edu/HaberlerDuyurular/Documents/DD20090326171642/Darwin_E1.pdf"&gt;Darwin Year Celebration&lt;/a&gt; activities, &lt;a href="http://ase.tufts.edu/cogstud/incbios/dennettd/dennettd.htm"&gt;Prof. Daniel C. Dennett&lt;/a&gt; is going to give a talk entitled "Darwin's strange inversion of reasoning" at the &lt;a href="http://muze.sabanciuniv.edu/main/default.php"&gt;Sakip Sabanci Museum&lt;/a&gt; on April 10, 2009 at 16:00.&lt;span class="fullpost"&gt;&lt;br /&gt;&lt;br /&gt;Dennett's 1995 book &lt;a href="http://www.amazon.com/Darwins-Dangerous-Idea-Evolution-Meanings/dp/068482471X"&gt;Darwin's Dangerous Idea&lt;/a&gt; argues that natural selection is a blind and algorithmic process sufficiently powerful to account for the generation and evolution of life, minds, and societies.  I am looking forward to his talk, an earlier version of which I had seen at MIT when the book first came out.  &lt;br /&gt;&lt;br /&gt;These days he has taken on religious fundamentalism (see &lt;a href="http://www.amazon.com/Breaking-Spell-Religion-Natural-Phenomenon/dp/0143038338"&gt;Breaking the Spell&lt;/a&gt;).  One of his proposed solutions to fight ignorance and intolerance is to teach children about ALL of world's religions instead of brainwashing them with a single system of thought, or leaving them vulnerable by not teaching them about religion at all.&lt;br /&gt;&lt;br /&gt;If you have not had the pleasure of listening to Dennett before, I recommend his many recorded talks available at the following websites: &lt;a href="http://www.ted.com/index.php/speakers/dan_dennett.html"&gt;TED talks&lt;/a&gt;, &lt;a href="http://en.wikipedia.org/wiki/Daniel_Dennett"&gt;Wikipedia&lt;/a&gt;, &lt;a href="http://www.reitstoen.com/dennett.php"&gt;Reitstoen.com&lt;/a&gt;, and &lt;a href="http://ase.tufts.edu/cogstud/incbios/dennettd/dennettd.htm"&gt;his homepage&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;Dennett is quite popular in the Artificial Intelligence / Cognitive Science community due to his refreshingly rational explanations of perplexing issues like consciousness and free will.  You may not agree with the specifics of his theories, but at least he makes a convincing case that there is no need for "magic dust" to explain these natural phenomena.  Talking about interesting psychological results in &lt;a href="http://www.amazon.com/Sweet-Dreams-Philosophical-Obstacles-Consciousness/dp/0262541912"&gt;Sweet Dreams&lt;/a&gt; he says:&lt;br /&gt;&lt;br /&gt;&lt;i&gt;I often discover skeptics who are quite confident that I am simply making these facts up!  But we must learn to treat such difficulties as measures of our frail powers of imagination, not insights into impossibility.&lt;/i&gt;&lt;br /&gt;&lt;br /&gt;Yet some of his adversaries take the failure of their imagination for a physical model of the mind as evidence of its impossibility, succumb to mysterianism, take comfort in the assumption that some questions will never be answered, or look for magic dust in the depths of quantum theory.  Dennett thinks one day we will find a psychological explanation for their defect.&lt;br /&gt;&lt;br /&gt;Many of Dennett's books &lt;a href="http://denizyuret.blogspot.com/search?q=dennett"&gt;that I have mentioned in this blog&lt;/a&gt; are on the mysteries of the mind.  If you get sleepy when you read philosophy, then I especially recommend &lt;a href="http://www.amazon.com/Minds-I-Fantasies-Reflections-Self/dp/0465030912"&gt;Mind's I&lt;/a&gt; which is one of my favorite collections of philosophical fiction.  Here is a list of his books which I hope to turn into an annotated bibliography at some point:&lt;br /&gt;&lt;br /&gt;&lt;a href="http://www.amazon.com/Content-and-Consciousness-ebook/dp/B000FA5WHG"&gt;Content and Consciousness&lt;/a&gt; (1969)&lt;br /&gt;&lt;a href="http://www.amazon.com/Brainstorms-Philosophical-Essays-Mind-Psychology/dp/0262540371"&gt;Brainstorms: Philosophical Essays on Mind and Psychology&lt;/a&gt; (1978)&lt;br /&gt;&lt;a href="http://www.amazon.com/Minds-I-Fantasies-Reflections-Self/dp/0465030912"&gt;Mind's I: Fantasies and Reflections on Self and Soul&lt;/a&gt; (1981)&lt;br /&gt;&lt;a href="http://www.amazon.com/Elbow-Room-Varieties-Worth-Wanting/dp/0262540428"&gt;Elbow Room: The Varieties of Free Will Worth Wanting&lt;/a&gt; (1984)&lt;br /&gt;&lt;a href="http://www.amazon.com/Intentional-Stance-Bradford-Books/dp/0262540533"&gt;The Intentional Stance&lt;/a&gt; (1987)&lt;br /&gt;&lt;a href="http://www.amazon.com/Consciousness-Explained-Daniel-C-Dennett/dp/0316180661"&gt;Consciousness Explained&lt;/a&gt; (1991)&lt;br /&gt;&lt;a href="http://www.amazon.com/Darwins-Dangerous-Idea-Evolution-Meanings/dp/068482471X"&gt;Darwin's Dangerous Idea: Evolution and the Meanings of Life&lt;/a&gt; (1995)&lt;br /&gt;&lt;a href="http://www.amazon.com/Kinds-Minds-Understanding-Consciousness-Science/dp/0465073514"&gt;Kinds of Minds: Towards and Understanding of Consciousness&lt;/a&gt; (1996)&lt;br /&gt;&lt;a href="http://www.amazon.com/Brainchildren-Essays-Designing-Minds-Representation/dp/0262540908"&gt;Brainchildren: Essays on Designing Minds&lt;/a&gt; (1998)&lt;br /&gt;&lt;a href="http://www.amazon.com/Freedom-Evolves-Daniel-C-Dennett/dp/0142003840"&gt;Freedom Evolves&lt;/a&gt; (2003)&lt;br /&gt;&lt;a href="http://www.amazon.com/Sweet-Dreams-Philosophical-Obstacles-Consciousness/dp/0262541912"&gt;Sweet Dreams: Philosophical Obstacles to a Science of Consciousness&lt;/a&gt; (2005)&lt;br /&gt;&lt;a href="http://www.amazon.com/Breaking-Spell-Religion-Natural-Phenomenon/dp/0143038338"&gt;Breaking the Spell: Religion as a Natural Phenomenon&lt;/a&gt; (2006)&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-5085935951356909337?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://www.sabanciuniv.edu/HaberlerDuyurular/Documents/DD20090326171642/Darwin_E1.pdf' title='Dennett in Istanbul'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/5085935951356909337/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=5085935951356909337' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/5085935951356909337'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/5085935951356909337'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2009/04/dennett-in-istanbul.html' title='Dennett in Istanbul'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-5144577763641499308</id><published>2009-03-22T18:00:00.001+02:00</published><updated>2010-11-03T09:08:54.398+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Türkçe'/><title type='text'>Matematik nereden gelir</title><content type='html'>Uzun zamandır rafımda toz tutan Lakoff ve Núñez'in &lt;a href="http://www.amazon.com/Where-Mathematics-Comes-Embodied-Brings/dp/0465037704"&gt;Where Mathematics Comes From&lt;/a&gt; (Matematik nereden gelir) kitabını yeni okumaya başlamıştım.  Geçen Perşembe de (serendipity'nin Türkçesi nedir?)  Alexandre Borovik'in Bilgi Üniversitesinde aynı konuda verdiği &lt;a href="http://ileriseviye.org/blog/?p=1765"&gt;Shadows of the Truth&lt;/a&gt; seminerine gitme şansını buldum, Emre sağolsun.  İnsan aklının matematiği nasıl ortaya çıkardığı sorusunu soran bu iki çalışma benzer bir sonuca varıyorlar: en soyut düşüncelerimiz bile metaforlarla somut, fiziksel, görsel, bedensel bilgilere bağlanıyor.  Sanırım dört işlem için parmaklarını kullanan, ya da bir üçgenin açıortaylarının aynı noktada kesiştiğini görebilmek için eline kağıt kalem alma ihtiyacı hisseden hiç kimse için bu sürpriz olmamalı.  Şaşırtıcı olan matematik eğitiminde bu bağlantıların daha yapıcı şekilde kullanılmaması. &lt;span class="fullpost"&gt;&lt;br /&gt;&lt;br /&gt;Özellikle çocuklarda henüz soyutla somutun arası çok açılmadığı için bu bağlantılar son derece açık.  Örneğin Lakoff ile Núñez temel aritmetiğin dört zihinsel işleve dayandığını iddia ediyorlar: farklı cisimleri bir araya getirmek, parçalarından bir cisim oluşturmak, cetvelle bir uzunluk ölçmek, ve bir yol üzerinde hareket etmek.  Borovik'in dört işlem örneklerinde çocukların sayı kavramını henüz saydıkları cisimlerden soyutlayamadıkları için aslında aritmetikten çok daha zor bir problemle uğraştıkları vurgulanıyor.  Örneğin elmalarla armutları toplayamayız derken, 10 elmayı 5 insana bölmek kurallara aykırı sayılmıyor okul kitaplarında.  Hele hele 10 elmayı ikişer ikişer paylaştırdığımızda beş insana yeteceğini hesaplarken (10/2=5), sol tarafta elmaları bölerken sağ taraftan insanların çıkmasına ne demeli?  Benzer örnekleri ve çocukların bunları nasıl kavramlaştırdığı konusundaki teorilerini Borovik'in &lt;a href="http://www.maths.manchester.ac.uk/~avb"&gt;web sayfasındaki&lt;/a&gt; iki kitap çalışmasında okuyabilirsiniz.&lt;br /&gt;&lt;br /&gt;Lakoff ve Núñez'in kitabının son bölümü tek bir örneğe ayrılmış.  Euler'in meşhur e^πi + 1 = 0 eşitliği.  Yazarlar haklı olarak matematiğin en meşhur sayılarını bir araya getiren bu eşitliğin 3^6=729 gibi rastgele bir rakamsal eşitlik olmadığını belirtiyorlar.  Euler'in eşitliğindeki her sabitin uzun bir metaforlar zinciriyle desteklenen derin birer anlamı var.  Bu zinciri bir noktada kaybeden öğrenci eşitliğin ispatını anlasa bile ne demek olduğunu anlamıyor.  Yazarların 70 sayfada sabırla çözümlediği bu metaforlar zincirini burada hakkıyla anlatmam zor.  Ama aşağıdaki sorulardan bazıları sizin de kafanızı karıştırıyorsa okumanızı tavsiye ederim:&lt;br /&gt;&lt;br /&gt;1. Eğer a^b işlemi a sayısını b defa kendisiyle çarpmak demek ise e sayısını kendisiyle pi defa çarpmak ne demek?  &lt;br /&gt;&lt;br /&gt;2. i sayısı nereden gelir, neden sanaldır, bir sayıyı i ile çarpmak ya da i'nci üssünü almak ne demek?&lt;br /&gt;&lt;br /&gt;3. e sayısı niye 2.718281828459045... değerine sahip?&lt;br /&gt;&lt;br /&gt;4. pi sayısı dairelerle ilgili birşey değil miydi?  Euler'in eşitliğinin dairelerle ne ilgisi var?&lt;br /&gt;&lt;br /&gt;5. İçinde e ve pi gibi iki tane sonsuz kesir olan bir işlem bize nasıl -1 gibi basit bir sonuç verebilir?&lt;br /&gt;&lt;br /&gt;6. Matematikte niye e, pi, i, 1 ve 0 sayıları durmadan ortaya çıkıyor ama çoğu sayı (örneğin 192563948.98542129) pek görünmüyor?  Bu sayıların sembolize ettiği fikir ve kavramlar nedir?&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-5144577763641499308?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://www.amazon.com/Where-Mathematics-Comes-Embodied-Brings/dp/0465037704' title='Matematik nereden gelir'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/5144577763641499308/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=5144577763641499308' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/5144577763641499308'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/5144577763641499308'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2009/03/matematik-nereden-gelir.html' title='Matematik nereden gelir'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-6126425705983015474</id><published>2009-03-03T11:21:00.005+02:00</published><updated>2011-09-22T19:05:34.741+03:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Publications'/><title type='text'>Classification of semantic relations between nominals</title><content type='html'>Roxana Girju, Preslav Nakov, Vivi Nastase, Stan Szpakowicz, Peter Turney and Deniz Yuret.  &lt;i&gt;Language Resources and Evaluation, 2009, 43(2), 105-121.&lt;/i&gt; (&lt;a href="https://docs.google.com/viewer?a=v&amp;pid=explorer&amp;chrome=true&amp;srcid=0B6C4-zOYlkxsZWIwYTZlNDYtN2Q3Yi00OTM1LTliM2MtMGU5ZGEyMzk0NGY3&amp;hl=en"&gt;PDF&lt;/a&gt;, &lt;a href="http://www.springerlink.com/content/cr044066132q4u15/fulltext.pdf"&gt;PDF&lt;/a&gt;, &lt;a href="http://www.springerlink.com/content/cr044066132q4u15/fulltext.html"&gt;HTML&lt;/a&gt;).&lt;span class="fullpost"&gt;&lt;br /&gt;&lt;br /&gt;Abstract: The NLP community has shown a renewed interest in deeper semantic analyses, among them automatic recognition of semantic relations in text. We present the development and evaluation of a semantic analysis task: automatic recognition of relations between pairs of nominals in a sentence. The task was part of SemEval-2007, the fourth edition of the semantic evaluation event previously known as SensEval. Apart from the observations we have made, the long-lasting effect of this task may be a framework for comparing approaches to the task. We introduce the problem of recognizing relations between nominals, and in particular the process of drafting and refining the definitions of the semantic relations. We show how we created the training and test data, list and briefly describe the 15 participating systems, discuss the results, and conclude with the lessons learned in the course of this exercise.&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-6126425705983015474?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://springerlink.com/content/cr044066132q4u15' title='Classification of semantic relations between nominals'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/6126425705983015474/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=6126425705983015474' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/6126425705983015474'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/6126425705983015474'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2009/03/classification-of-semantic-relations.html' title='Classification of semantic relations between nominals'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-7629547836618956312</id><published>2009-03-01T18:45:00.016+02:00</published><updated>2010-11-03T09:08:54.399+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Books'/><title type='text'>Incandescence by Greg Egan</title><content type='html'>Greg Egan's latest novel &lt;a href="http://www.amazon.com/Incandescence-Greg-Egan/dp/1597801283"&gt;Incandescence&lt;/a&gt; is a great thought experiment exploring how a primitive alien race living inside the tunnels of an asteroid orbiting around a neutron star can find evidence for Newtonian physics and even general relativity using simple experiments inside their closed world.  The motion around the star causes a nontrivial field of forces inside the asteroid.  There is no weight at the center.  Away from the center, the amount and direction of the weight depends on the location relative to the center.  Zak, the alien Newton, starts to explain this state of affairs with a simple assumption: "Weight is the difference between preferred and actual motion." &lt;span class="fullpost"&gt;&lt;br /&gt;&lt;br /&gt;For example, when an object is placed north or south of the center, it is pulled towards the center.  Here is what an object left to free fall from the north of the center looks like to an observer inside the asteroid:&lt;br /&gt;&lt;div style="width:400px; overflow:hidden;"&gt;&lt;br /&gt;&lt;img src="http://gregegan.customer.netspace.net.au/INCANDESCENCE/Orbits/orbit2.gif" style="margin-left:-400px;  padding:0px; background-image:url(http://gregegan.customer.netspace.net.au/INCANDESCENCE/Orbits/orbit2key.gif); "/&gt;&lt;br /&gt;&lt;/div&gt;&lt;br /&gt;&lt;br /&gt;However from a different perspective, one can see that the object is actually moving around its own slightly tilted orbit around the star.  And if the object is supported by a floor and is prevented from moving toward the center, it will experience a gravitational pull toward the center -- thus an example of the maxim: "Weight is the difference between preferred and actual motion." &lt;br /&gt;&lt;div style="width:400px; overflow:hidden;"&gt;&lt;br /&gt;&lt;img src="http://gregegan.customer.netspace.net.au/INCANDESCENCE/Orbits/orbit2.gif" style="padding:0px; background-image:url(http://gregegan.customer.netspace.net.au/INCANDESCENCE/Orbits/orbit2key.gif); "/&gt;&lt;br /&gt;&lt;/div&gt;&lt;br /&gt;&lt;br /&gt;When two object are placed toward and away from the sun with respect to the center of the asteroid, they feel a pull away from the center:&lt;br /&gt;&lt;div style="width:400px; overflow:hidden;"&gt;&lt;br /&gt;&lt;img src="http://gregegan.customer.netspace.net.au/INCANDESCENCE/Orbits/orbit3.gif" style="margin-left:-400px;  padding:0px; background-image:url(http://gregegan.customer.netspace.net.au/INCANDESCENCE/Orbits/orbit3key.gif); "/&gt;&lt;br /&gt;&lt;/div&gt;&lt;br /&gt;&lt;br /&gt;Again, from a different perspective, this can be seen as the objects trying to follow their own natural orbits, and weight being the difference between preferred and actual motion:&lt;br /&gt;&lt;div style="width:400px; overflow:hidden;"&gt;&lt;br /&gt;&lt;img src="http://gregegan.customer.netspace.net.au/INCANDESCENCE/Orbits/orbit3.gif" style="padding:0px; background-image:url(http://gregegan.customer.netspace.net.au/INCANDESCENCE/Orbits/orbit3key.gif); "/&gt;&lt;br /&gt;&lt;/div&gt;&lt;br /&gt;&lt;br /&gt;As a final example, suppose an object at the center is given a radial push.  Unlike the objects in the previous example, this object will not accelerate away from the center, but keep cycling in an elliptical course as seen from inside the asteroid:&lt;br /&gt;&lt;div style="width:400px; overflow:hidden;"&gt;&lt;br /&gt;&lt;img src="http://gregegan.customer.netspace.net.au/INCANDESCENCE/Orbits/orbit4.gif" style="margin-left:-400px; padding:0px; background-image:url(http://gregegan.customer.netspace.net.au/INCANDESCENCE/Orbits/orbit4key.gif); "/&gt;&lt;br /&gt;&lt;/div&gt;&lt;br /&gt;&lt;br /&gt;The standard explanation is in terms of Coriolis force due to the object's motion balancing the tidal acceleration.  From the orbital perspective, our radial push places the object in an elliptical orbit crossing the circular orbit of the planet:&lt;br /&gt;&lt;div style="width:400px; overflow:hidden;"&gt;&lt;br /&gt;&lt;img src="http://gregegan.customer.netspace.net.au/INCANDESCENCE/Orbits/orbit4.gif" style="padding:0px; background-image:url(http://gregegan.customer.netspace.net.au/INCANDESCENCE/Orbits/orbit4key.gif); "/&gt;&lt;br /&gt;&lt;/div&gt;&lt;br /&gt;&lt;br /&gt;Starting with these and many other interesting observations, Zak and his friends uncover the secrets of their universe before ever stepping outside their closed world and ever seeing their sun, their orbit, or fixed stars.  The book is a treat for aficionados of gedankenexperiments and qualitative physics.&lt;br /&gt;&lt;br /&gt;P.S. I'd like to thank Greg Egan for allowing the use of these animations from &lt;a href="http://www.gregegan.net"&gt;his page&lt;/a&gt;, which hosts a treasure of information about science and science fiction.&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-7629547836618956312?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://www.amazon.com/Incandescence-Greg-Egan/dp/1597801283' title='Incandescence by Greg Egan'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/7629547836618956312/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=7629547836618956312' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/7629547836618956312'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/7629547836618956312'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2009/02/incandescence-by-greg-egan.html' title='Incandescence by Greg Egan'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-3926988295343857786</id><published>2009-02-16T16:50:00.006+02:00</published><updated>2010-11-03T09:08:54.400+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Notes'/><category scheme='http://www.blogger.com/atom/ns#' term='Downloads'/><title type='text'>Ergun's English-Turkish machine translation notes</title><content type='html'>Here are some useful notes from Ergun Bicici on getting started with Turkish-English machine translation, followed by some suggestions by Murat Alperen on collecting Turkish-English parallel text data. &lt;span class="fullpost"&gt;&lt;br /&gt;&lt;br /&gt;Turkish English parallel text from Kemal Oflazer, Statistical Machine Translation into a Morphologically Complex Language, Invited Paper, In Proceedings of &lt;a href="http://www.gelbukh.com/cicling/2008/"&gt;CICLING 2008&lt;/a&gt; -- Conference on Intelligent Text Processing and Computational Linguistics, Haifa, Israel, February 2008 (lowercased and converted to utf8):&lt;br /&gt;&lt;a href="http://deniz.yuret.com/turkish/en-tr.zip"&gt; en-tr.zip&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;The Turkish part of the dataset is "selectively split", i.e. some suffixes are separated from their stems, some are not.&lt;br /&gt;&lt;br /&gt;Here is the Turkish text to develop the language model:&lt;br /&gt;&lt;a href="http://deniz.yuret.com/turkish/lm.tr.gz"&gt; lm.tr.gz&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;The directions for the Moses baseline system:&lt;br /&gt;&lt;a href="http://www.statmt.org/wmt09/baseline.html"&gt; http://www.statmt.org/wmt09/baseline.html&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;The link for the scripts:&lt;br /&gt;&lt;a href="http://www.statmt.org/wmt08/scripts.tgz"&gt; http://www.statmt.org/wmt08/scripts.tgz&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;Be careful to put the stems and suffixes back together before computing the BLEU score.  Splitting them artificially increases the score.&lt;br /&gt;&lt;br /&gt;To compute the score do not use the mteval scorer at http://www.statmt.org/wmt09/baseline.html - because it retokenizes the input and splits all the '+' characters that are used to denote suffixes.  Either use the multi-bleu perl script, or comment out the  language-dependent part of NormalizeText in mteval.&lt;br /&gt;&lt;br /&gt;For Turkish dictionaries and other resources please see &lt;a href="http://denizyuret.blogspot.com/2006/11/turkish-resources.html"&gt;Turkish language resources&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-3926988295343857786?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/3926988295343857786/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=3926988295343857786' title='10 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/3926988295343857786'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/3926988295343857786'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2009/02/erguns-english-turkish-machine.html' title='Ergun&apos;s English-Turkish machine translation notes'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>10</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-9005091566762497993</id><published>2009-02-11T18:46:00.010+02:00</published><updated>2010-11-03T09:08:54.401+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Notes'/><title type='text'>Turkish morphology presentation</title><content type='html'>Please click on "Full Post" to view an introductory Turkish morphology presentation.  To learn more about Turkish morphology I recommend &lt;a href="http://www.denizyuret.com/ref/oflazer/BU-CEIS-9304.pdf"&gt; this paper&lt;/a&gt; by Kemal Oflazer.  For morphological disambiguation check out &lt;a href="http://www.cmpe.boun.edu.tr/~hasim"&gt; Haşim Sak's webpage &lt;/a&gt; or &lt;a href="http://denizyuret.blogspot.com/2006/06/learning-morphological-disambiguation.html"&gt; this paper&lt;/a&gt; by Deniz Yuret and Ferhan Türe.  You can try analyzing a Turkish word here: &lt;form method="post" action="http://www.denizyuret.com/cgi-bin/mor.cgi"&gt; &lt;input type="text" name="words" /&gt; &lt;/form&gt;&lt;span class="fullpost"&gt;&lt;br /&gt;&lt;br /&gt;&lt;iframe src='http://docs.google.com/EmbedSlideshow?docid=d2jm3f3_189zszc8bct' frameborder='0' width='410' height='342'&gt;&lt;/iframe&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-9005091566762497993?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://docs.google.com/Presentation?id=d2jm3f3_189zszc8bct' title='Turkish morphology presentation'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/9005091566762497993/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=9005091566762497993' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/9005091566762497993'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/9005091566762497993'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2009/02/turkish-morphology-presentation.html' title='Turkish morphology presentation'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-3274500759910285955</id><published>2009-02-02T21:41:00.004+02:00</published><updated>2010-11-03T09:08:54.401+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Türkçe'/><title type='text'>İngilizce Türkçe otomatik tercüme</title><content type='html'>Google sonunda otomatik tercüme yaptığı dillere Türkçe'yi de&lt;br /&gt;ekledi.  Denemek isterseniz: &lt;a href="http://translate.google.com/"&gt;http://translate.google.com&lt;/a&gt;&lt;span class="fullpost"&gt;&lt;br /&gt;&lt;br /&gt;Bu teknolojinin İngilizce bilmeyen Türk nüfusunun internetteki&lt;br /&gt;bilgi birikimine ulaşımı için önemli olduğunu düşünüyor ve birkaç&lt;br /&gt;yıldır üzerinde ben de çalışıyorum.  En büyük engellerden biri&lt;br /&gt;araştırma amacıyla kullanılabilecek yüklü miktarda&lt;br /&gt;İngilizce-Türkçe paralel metne ihtiyaç olması (yaklaşık 100 milyon&lt;br /&gt;kelime = 1000 kitap).  Bu metni toplayabilmek için bir iki yıl&lt;br /&gt;telefonla devlet kurumları, uluslararası kuruluşlar, yayınevi,&lt;br /&gt;haber kurumu, hukuk ve tercümanlık şirketleri, üniversite&lt;br /&gt;bölümleri vs ile görüşüp pozitif bir cevap alamayınca yoruldum ve&lt;br /&gt;vazgeçtim.  İşin üzücü tarafı karşılaştığım büyük engelin yayın&lt;br /&gt;hakkı, fikir mülkiyeti gibi hukuksal bir konu değil, insanların&lt;br /&gt;ilgisizliği olması.  Şimdilik bir iki milyon kelimelik metinden&lt;br /&gt;geliştirilmiş oyuncak bir sistemle uğraşıyorum öğrencilerimle.&lt;br /&gt;Google'ın sistemini ben yazmış olmak isterdim.  Ama henüz yarışma&lt;br /&gt;bitmiş değil, sistemin kalitesine bir örnek olarak bu paragrafın&lt;br /&gt;bir tercümesini veriyorum...&lt;br /&gt;&lt;br /&gt;This technology does not speak English in the Turkish population&lt;br /&gt;on the Internet access to knowledge is important to think and a&lt;br /&gt;few years, I am working on. One of the biggest obstacles can be&lt;br /&gt;used for research purposes in the amount of installed&lt;br /&gt;English-English parallel text is needed (about 100 million word =&lt;br /&gt;1000 books). This text to be able to collect a two-year phone and&lt;br /&gt;the state institutions, international organizations, publishing,&lt;br /&gt;news agency, law and translation companies, universities sections&lt;br /&gt;and with a positive answer so do not get tired and I've given&lt;br /&gt;up. The major obstacle faced by the unfortunate job of&lt;br /&gt;broadcasting the The right to legal issues such as property, not&lt;br /&gt;the ideas, people is indifference. Currently, one of two million&lt;br /&gt;words of text I'm dealing with a system developed for students&lt;br /&gt;with toys.  I would like it to Google's system. But the contest&lt;br /&gt;yet not finished, as an example of the quality of the system of&lt;br /&gt;this paragraph I give a translation...&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-3274500759910285955?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://translate.google.com' title='İngilizce Türkçe otomatik tercüme'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/3274500759910285955/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=3274500759910285955' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/3274500759910285955'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/3274500759910285955'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2009/02/ingilizce-turkce-otomatik-tercume.html' title='İngilizce Türkçe otomatik tercüme'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-6539349836923660337</id><published>2009-01-28T12:12:00.004+02:00</published><updated>2010-11-03T08:14:40.093+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Students'/><title type='text'>Neşe Aral, M.S. 2009</title><content type='html'>&lt;iframe src='http://docs.google.com/EmbedSlideshow?docid=d2jm3f3_570nr4d2tg5' frameborder='0' width='410' height='342'&gt;&lt;/iframe&gt;&lt;span class="fullpost"&gt;&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Dynamics of Gene Regulatory Cell Cycle Network in Saccharomyces Cerevisiae&lt;/b&gt;&lt;br /&gt;Neşe Aral. M.S. Thesis, Koç University Department of Physics, January 2009.&lt;br /&gt;&lt;br /&gt;Abstract:  In this thesis, the genetic regulatory dynamics within the cell cycle network of the yeast Saccharomyces Cerevisiae is examined. As the mathematical approach, an asynchronously updated Boolean network is used to model the time evolution of the expression level of genes taking part in the regulation of the  cell-cycle. The attractors of the model’s dynamics and their stability are investigated by means of a stochastic transition matrix. It is shown that the cell cycle network has unusual dynamical properties when compared with similar random networks. Furthermore, an entropy measure is employed to monitor the sequential evolution of the system. It is observed that the experimentally identified cell cycle phases G1, S, G2 and M correspond to the stages of the network where the entropy goes through a local extremum.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://deniz.yuret.com/students/naral/nese_tez.pdf"&gt;Download PDF&lt;/a&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-6539349836923660337?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://deniz.yuret.com/students/naral/nese_tez.pdf' title='Neşe Aral, M.S. 2009'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/6539349836923660337/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=6539349836923660337' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/6539349836923660337'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/6539349836923660337'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2009/01/nese-aral-ms-2009.html' title='Neşe Aral, M.S. 2009'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-2962070668525203261</id><published>2008-12-03T16:41:00.008+02:00</published><updated>2010-11-03T09:08:54.402+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Notes'/><title type='text'>Show and Tell</title><content type='html'>Here is a little demo video of some work I did with Sajit at MIT CSAIL this summer.  The computer watches us play with a ball and produces live commentary.  For now the detection of actions (like give, drop, move) are hand coded, the next step would be to learn them from examples.  The step after that is "tell and show", i.e. to go from words to pictures.  This would complete the imagination-perception loop which may underlie much understanding and problem solving.&lt;br /&gt;&lt;br /&gt;I think one of the coolest things about the current implementation is how the computer starts a sentence and cuts it in half to say something more important.  There is always tons of possible things to say and possible words to say them with, and a similar competition must be going on in our heads.&lt;br /&gt;&lt;br /&gt;&lt;object width="320" height="266" class="BLOG_video_class" id="BLOG_video-29e70b6d0b12e695" classid="clsid:D27CDB6E-AE6D-11cf-96B8-444553540000" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=6,0,40,0"&gt;&lt;param name="movie" value="http://www.youtube.com/get_player"&gt;&lt;param name="bgcolor" value="#FFFFFF"&gt;&lt;param name="allowfullscreen" value="true"&gt;&lt;param name="flashvars" value="flvurl=http://v19.nonxt7.googlevideo.com/videoplayback?id%3D29e70b6d0b12e695%26itag%3D5%26app%3Dblogger%26ip%3D0.0.0.0%26ipbits%3D0%26expire%3D1330148624%26sparams%3Did,itag,ip,ipbits,expire%26signature%3D2F3AAA425840329B9645F0563802A1DB36188BC2.694AE4D088E47786C192C17EA32253D4FFBE9B70%26key%3Dck1&amp;amp;iurl=http://video.google.com/ThumbnailServer2?app%3Dblogger%26contentid%3D29e70b6d0b12e695%26offsetms%3D5000%26itag%3Dw160%26sigh%3DFbptm0co5B4b7XnAlSQtXMc1p8Q&amp;amp;autoplay=0&amp;amp;ps=blogger"&gt;&lt;embed src="http://www.youtube.com/get_player" type="application/x-shockwave-flash"width="320" height="266" bgcolor="#FFFFFF"flashvars="flvurl=http://v19.nonxt7.googlevideo.com/videoplayback?id%3D29e70b6d0b12e695%26itag%3D5%26app%3Dblogger%26ip%3D0.0.0.0%26ipbits%3D0%26expire%3D1330148624%26sparams%3Did,itag,ip,ipbits,expire%26signature%3D2F3AAA425840329B9645F0563802A1DB36188BC2.694AE4D088E47786C192C17EA32253D4FFBE9B70%26key%3Dck1&amp;iurl=http://video.google.com/ThumbnailServer2?app%3Dblogger%26contentid%3D29e70b6d0b12e695%26offsetms%3D5000%26itag%3Dw160%26sigh%3DFbptm0co5B4b7XnAlSQtXMc1p8Q&amp;autoplay=0&amp;ps=blogger"allowFullScreen="true" /&gt;&lt;/object&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-2962070668525203261?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='enclosure' type='video/mp4' href='http://www.blogger.com/video-play.mp4?contentId=29e70b6d0b12e695&amp;type=video%2Fmp4' length='0'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/2962070668525203261/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=2962070668525203261' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/2962070668525203261'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/2962070668525203261'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2008/12/show-and-tell.html' title='Show and Tell'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-996541619253152936</id><published>2008-10-31T16:03:00.007+02:00</published><updated>2009-03-27T03:32:51.221+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Publications'/><title type='text'>Morphological cues vs. number of nominals in learning verb types in Turkish: Syntactic bootstrapping mechanism revisited</title><content type='html'>Deniz Yuret, A. Engin Ural, Nihan Ketrez, Dilara Koçbaş and Aylin C. Küntay. In &lt;i&gt;The Boston University Conference on Language Development (BUCLD)&lt;/i&gt; (&lt;a href="http://docs.google.com/Doc?id=d2jm3f3_3254fzphg6"&gt;Long abstract&lt;/a&gt;, &lt;a href="http://www.denizyuret.com/research/2008/bucld33/BUCLD2008NK.ppt"&gt; poster&lt;/a&gt;, &lt;a href="http://www.bu.edu/linguistics/APPLIED/BUCLD/supplement33/Ural.pdf"&gt; PDF&lt;/a&gt;)&lt;span class="fullpost"&gt;&lt;br /&gt;&lt;br /&gt;Abstract: The syntactic bootstrapping mechanism of verb classification was evaluated against child-directed speech in Turkish, a language with rich morphology, nominal ellipsis and free word order. Machine-learning algorithms were run on transcribed caregiver speech (12,276 and 20,687 utterances) directed to two Turkish learners (one hour every two weeks between 0,9 to 1;10) of different socioeconomic backgrounds. The corpora contained 12,276 and 20,687 child-directed utterances. Study 1 found that the number of nominals in child-directed utterances plays some role in classifying transitive and intransitive verbs. Study 2 found that accusative morphology on the noun is a stronger cue in clustering verb types. Study 3 found that verbal morphology is useful in distinguishing between different subtypes of intransitive verbs. These results suggest that syntactic bootstrapping mechanisms should be extended to include morphological cues to verb learning in morphologically rich languages.&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-996541619253152936?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://www.bu.edu/linguistics/APPLIED/BUCLD/supplement33/Ural.pdf' title='Morphological cues vs. number of nominals in learning verb types in Turkish: Syntactic bootstrapping mechanism revisited'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/996541619253152936/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=996541619253152936' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/996541619253152936'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/996541619253152936'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2008/10/morphological-cues-vs-number-of.html' title='Morphological cues vs. number of nominals in learning verb types in Turkish: Syntactic bootstrapping mechanism revisited'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-4741823383099032949</id><published>2008-09-20T22:55:00.004+03:00</published><updated>2010-11-03T09:08:54.403+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Notes'/><title type='text'>Parser Training and Evaluation using Textual Entailments</title><content type='html'>&lt;i&gt;A task proposal for SemEval-2010 by Deniz Yuret and Önder Eker.  For example sentences and the bibliography, please download the &lt;a href="http://www.denizyuret.com/research/2009/pete/proposal/pete.pdf"&gt;original PDF&lt;/a&gt;.&lt;/i&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight:bold;"&gt;Description of the Task&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;We propose a targeted textual entailment task designed to train and evaluate parsers.  The typical parser training and evaluation methodology uses a gold treebank which raises several issues: (1) The treebank is built around a particular linguistic representation which makes parsers that use different representations (e.g. phrase structure vs. dependency) difficult to compare.  (2) The parsers are evaluated based on how much of the linguistic structure dictated by the treebank they can replicate, some of which may be irrelevant for downstream applications.  (3) The annotators that create the treebank not only have to understand the sentences in the corpus, but master the particular linguistic representation used, which makes their training difficult and leads to inconsistencies (see Carroll1998 for a review and Parseval2008 for more recent work on parser evaluation).&lt;br /&gt;&lt;br /&gt;In the proposed method, simple textual entailments like the following will be used to fine-tune and evaluate different parsers:&lt;span class="fullpost"&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;Final-hour trading accelerated to 108.1 million shares, a record for the Big Board.&lt;br /&gt;  &lt;ul&gt;&lt;li&gt;108.1 million shares was a record.  -- YES &lt;br /&gt;  &lt;/li&gt;&lt;li&gt;Final-hour trading accelerated a record.  -- NO&lt;br /&gt;  &lt;/li&gt;&lt;/ul&gt;&lt;br /&gt;&lt;/li&gt;&lt;li&gt;Earlier the company announced it would sell its aging fleet of Boeing Co. 707s because of increasing maintenance costs.  &lt;br /&gt;  &lt;ul&gt;&lt;li&gt;It would sell the fleet because of increasing costs.  -- YES &lt;br /&gt;  &lt;/li&gt;&lt;li&gt;Selling the fleet would increase maintenance costs.  -- NO &lt;br /&gt;  &lt;/li&gt;&lt;/ul&gt;&lt;br /&gt;&lt;/li&gt;&lt;li&gt;Persistent redemptions would force some fund managers to dump stocks to raise cash.&lt;br /&gt;  &lt;ul&gt;&lt;li&gt;The managers would dump stocks to raise cash.  -- YES&lt;br /&gt;  &lt;/li&gt;&lt;li&gt;The stocks would raise cash.  -- NO   &lt;/li&gt;&lt;/ul&gt;&lt;/li&gt;&lt;/ul&gt;&lt;br /&gt;The entailment examples will be generated based on the following criteria: (1) It should be possible to automatically decide which entailments are implied based on the parser output only, i.e. there should be no need for lexical semantics, anaphora resolution etc.  (2) It should be easy for a non-linguist annotator to decide which entailments are implied, reducing the time for training and increasing inter-annotator agreement.  (3) The entailments should be non-trivial, i.e. they should focus on areas of disagreement between current state of the art parsers.  The above examples satisfy all three criteria.&lt;br /&gt;&lt;br /&gt;Training and evaluating parsers based on targeted textual entailments address each of the issues listed in the first paragraph regarding treebank based methods: The evaluation is representation independent, therefore there is no difficulty in comparing the performance of parsers from different frameworks.  By focusing on the parse differences that result in different entailments, we ignore trivial differences that stem from the conventions of the underlying representation which should not matter for downstream applications.  Finally our annotators will only need a good understanding of the English language and no expertise on any linguistic framework.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight:bold;"&gt;Generating Data&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;The example entailment questions can be generated by considering the differences between the outputs of different state of the art parsers and their gold datasets.  Some of the detected parse differences can be turned into different entailments about the sentence.  The example entailments in the previous section were generated comparing the outputs of two dependency parsers, which are included in the appendix.  The generated entailments will then be annotated by multiple annotators and the differences will be resolved using standard techniques.&lt;br /&gt;&lt;br /&gt;Generating entailment questions out of parser differences allow us to satisfy conditions 1 and 3 easily: the entailments can be judged based on parser output because that is how they were generated, and they are non-trivial because some state of the art parsers disagree on them.&lt;br /&gt;&lt;br /&gt;In our experience the most difficult condition to satisfy is 2: that it should be easy for a non-linguist annotator to decide which entailments are implied.  In most of the example sentences we looked at, the differences between the parsers were trivial, e.g. different conventions on how to tag coordinating conjunctions, or whether to label a particular phrase with ADVP vs. ADJP etc.  These differences are trivial in the sense that it is impossible to generate different entailments from them, thus it is hard to see how they would matter in a downstream application.&lt;br /&gt;&lt;br /&gt;The trivial differences between parsers make example generation using our process difficult.  The efficiency of example generation may be improved by pre-filtering candidate sentences which contain structures that involve non-trivial decisions by the parser such as prepositional phrase attachments.  In addition some types of entailment generation can be automated.  On the other hand the requirement of expressing differences in entailments will hopefully focus the training and the evaluation on non-trivial differences that actually matter in applications.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight:bold;"&gt;Evaluation Methodology&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;The participants will be provided with training and test sets of entailments and they will be evaluated using the standard tools and methodology of the RTE challenges (Dagan2006).  The main difference is that our entailment examples focus exclusively on parsing.  This should make it possible to write simple tree matching modules that decide on the entailments based on parser output alone.  Example tree matching modules for standard formats (Penn Treebank (Marcus1993) format for phrase structure parsing, and CoNLL (Nivre2007) format for dependency parsing) will be provided as examples which should make preparing an existing parser for evaluation relatively easy.&lt;br /&gt;&lt;br /&gt;The training part will be more parser specific, so individual participants will have to decide how to best make use of the provided entailment training set.  It is unlikely that we will be able to generate enough entailment examples to train a parser from scratch.  Therefore the task will have to be open to using other resources.  The participants will be free to use standard resources such as treebanks to train their parsers.  We can also consider restricting the outside training resources (e.g. Penn Treebank only) and the domain of the entailments (e.g. finance only).  The entailment training set can then be used to fine tune the parser by focusing the evaluation on important parser decisions that effect downstream applications.&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-4741823383099032949?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://www.denizyuret.com/research/2009/pete/proposal/pete.pdf' title='Parser Training and Evaluation using Textual Entailments'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/4741823383099032949/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=4741823383099032949' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/4741823383099032949'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/4741823383099032949'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2008/09/parser-training-and-evaluation-using.html' title='Parser Training and Evaluation using Textual Entailments'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-4654152392055519332</id><published>2008-08-16T15:52:00.007+03:00</published><updated>2010-01-16T08:24:39.717+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Publications'/><title type='text'>Discriminative vs. Generative Approaches in Semantic Role Labeling</title><content type='html'>Deniz Yuret, Mehmet Ali Yatbaz and Ahmet Engin Ural.  In &lt;i&gt;Proceedings of The Twelfth Conference on Natural Language Learning (CoNLL-2008)&lt;/i&gt; (&lt;a href="http://www.cnts.ua.ac.be/conll2008/pdf/22327.pdf"&gt;PDF&lt;/a&gt;, &lt;a href="http://portal.acm.org/citation.cfm?id=1596324.1596364"&gt;ACM&lt;/a&gt;)&lt;span class="fullpost"&gt;&lt;br /&gt;&lt;br /&gt;Abstract: This paper describes the two algorithms we developed for the CoNLL 2008 Shared Task “Joint learning of syntactic and semantic dependencies”. Both algorithms start parsing the sentence using the same syntactic parser. The first algorithm uses machine learning methods to identify the semantic dependencies in four stages:  identification and labeling of predicates, identification and labeling of arguments. The second algorithm uses a generative probabilistic model, choosing the semantic dependencies that maximize the probability with respect to the model. A hybrid algorithm combining the best stages of the two algorithms attains 86.62% labeled syntactic attachment accuracy, 73.24% labeled semantic dependency F1 and 79.93% labeled macro F1 score for the combined WSJ and Brown test sets.&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-4654152392055519332?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://www.cnts.ua.ac.be/conll2008/pdf/22327.pdf' title='Discriminative vs. Generative Approaches in Semantic Role Labeling'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/4654152392055519332/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=4654152392055519332' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/4654152392055519332'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/4654152392055519332'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2008/08/discriminative-vs-generative-approaches.html' title='Discriminative vs. Generative Approaches in Semantic Role Labeling'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-7898661268007048692</id><published>2008-08-01T13:00:00.002+03:00</published><updated>2010-11-03T08:14:40.094+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Students'/><title type='text'>Ahmet Engin Ural, M.S. 2008</title><content type='html'>&lt;b&gt;Evolution of Compositionality with a Bag of Words Syntax&lt;/b&gt;&lt;br /&gt;Ahmet Engin Ural, M.S. Thesis, Koç University Department of Computer Engineering, August 2008.&lt;span class="fullpost"&gt;&lt;br /&gt;&lt;br /&gt;In the last two decades, the idea of an emerging and evolving language has been studied thoroughly. The main question behind this kind of studies is how a group of humans reaches an agreement on the phonology, lexicon and syntax. The improvements in computational tools led the researchers build and test models that have been ran computer simulations to answer the question. Although the models are mere reflections of the reality, the results have been often useful and insightful. This dissertation follows the same line and proposes a new model, tested in a game based simulation methodology. Besides, this work tries to fill the gap in the studies of lexicon compositionality and proposes a plausible explanation for the transition from single word naming to multi word naming. The direction of the results is in line with the previous research such as the emergence of a stable and communicative language. Moreover compositionality in lexicon is observed with a very simple bag of words syntax. The parameters influencing the results are analyzed in depth. Even though the model does not meet the standards of the real world, future work hints insightful facts about the transition from single word naming to syntax.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://aeural.googlepages.com/aeu.v.3.pdf"&gt;Download thesis.&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;b&gt;A selection of Evolution of Language Resources  &lt;/b&gt;&lt;p style="text-align: left;"&gt;Biannual Evolution of Language Conference: &lt;a href="http://stel.ub.edu/evolang2008/"&gt;evolang 2008&lt;/a&gt;&lt;/p&gt;&lt;p style="text-align: left;"&gt;University of Edinburgh, &lt;a href="http://www.ling.ed.ac.uk/lec/LEC/Welcome.html"&gt;Language Evolution and Computation Research Unit&lt;/a&gt; &lt;/p&gt;&lt;p style="text-align: left;"&gt;&lt;a href="http://www.ling.ed.ac.uk/lec/LEC/Welcome.html"&gt;A literature overview&lt;/a&gt; by &lt;a href="http://ftp.ling.ed.ac.uk/%7Esimon/"&gt;Simon Kirby&lt;/a&gt;&lt;/p&gt;&lt;p style="text-align: left;"&gt;The book &lt;a href="http://ftp.ling.ed.ac.uk/%7Esimon/langevol.html"&gt;Language Evolution&lt;/a&gt; by Morten Christiansen and Simon Kirby&lt;/p&gt;&lt;p style="text-align: left;"&gt;A PhD thesis by Joris van Looveren, &lt;a href="http://arti.vub.ac.be/docs/jvanlooveren-phd.pdf"&gt;Design and Performance of Pre-Grammatical Language Games&lt;/a&gt;&lt;/p&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-7898661268007048692?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://aeural.googlepages.com/msthesis' title='Ahmet Engin Ural, M.S. 2008'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/7898661268007048692/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=7898661268007048692' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/7898661268007048692'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/7898661268007048692'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2008/08/ahmet-engin-ural-ms-2008.html' title='Ahmet Engin Ural, M.S. 2008'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-6700295568068023128</id><published>2008-06-15T16:07:00.010+03:00</published><updated>2010-11-03T08:27:27.309+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Publications'/><category scheme='http://www.blogger.com/atom/ns#' term='Downloads'/><title type='text'>Smoothing a Tera-word Language Model</title><content type='html'>Deniz Yuret.  In &lt;i&gt;The 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies (ACL-08: HLT)&lt;/i&gt; (Download &lt;a href="http://aclweb.org/anthology-new/P/P08/P08-2036.pdf"&gt;PDF&lt;/a&gt;)&lt;br /&gt;Download the software used in this study: &lt;a href="http://sites.google.com/a/yuret.com/deniz/2008/06/smoothing-tera-word-language-model/glookup.tgz"&gt;glookup.tgz&lt;/a&gt; reads ngram patterns (possibly containing wildcards) from stdin, finds their counts in one pass from Google Web1T data, and prints the results.  &lt;a href="http://sites.google.com/a/yuret.com/deniz/2008/06/smoothing-tera-word-language-model/glookup.pl"&gt;glookup.pl&lt;/a&gt; quickly searches for a given pattern in uncompressed Google Web1T data.  Use the first one for bulk processing, the second one to get a few counts quickly.&lt;br /&gt;&lt;span class="fullpost"&gt;&lt;br /&gt;&lt;br /&gt;&lt;iframe src='http://docs.google.com/EmbedSlideshow?docid=d2jm3f3_967xbp4cdck' frameborder='0' width='410' height='342'&gt;&lt;/iframe&gt;&lt;br /&gt;&lt;br /&gt;Abstract: Frequency counts from very large corpora, such as the Web 1T dataset, have recently become available for language modeling. Omission of low frequency n-gram counts is a practical necessity for datasets of this size. Naive implementations of standard smoothing methods do not realize the full potential of such large datasets with missing counts. In this paper I present a new smoothing algorithm that combines the Dirichlet prior form of (Mackay and Peto, 1995) with the modified back-off estimates of (Kneser and Ney, 1995) that leads to a 31% perplexity reduction on the Brown corpus compared to a baseline implementation of Kneser-Ney discounting.&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-6700295568068023128?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://aclweb.org/anthology-new/P/P08/P08-2036.pdf' title='Smoothing a Tera-word Language Model'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/6700295568068023128/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=6700295568068023128' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/6700295568068023128'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/6700295568068023128'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2008/06/smoothing-tera-word-language-model.html' title='Smoothing a Tera-word Language Model'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-1257444390874997646</id><published>2008-04-24T21:24:00.002+03:00</published><updated>2010-11-03T09:08:54.404+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Türkçe'/><title type='text'>Olasılık ve kehanet</title><content type='html'>Diyelim finans danışmanınız bir gün geldi ve borsanın yönünü&lt;br /&gt;yüksek oranda doğru olarak tahmin edebilen bir algoritma bulduğunu&lt;br /&gt;iddia etti. Algoritma girdi olarak son on günün hareketlerini&lt;br /&gt;(aşağı ya da yukarı) alıyor ve çıktı olarak yarınki yönü (aşağı ya&lt;br /&gt;da yukarı) belirliyor. Danışman son dört yılın datasını (yaklaşık&lt;br /&gt;1000 gün) kullanarak algoritmayı geliştirdiğini ve aynı dönemde&lt;br /&gt;simüle edildiğinde algoritmanın %90'ın üzerinde başarılı olduğunu&lt;br /&gt;açıklıyor.  Paranızı yatırır mısınız?&lt;br /&gt;&lt;br /&gt;İpuçları:&lt;br /&gt;1. Eğer borsanın yönü yazı tura ile belirlense bu algoritmadan nasıl bir performans beklerdiniz?&lt;br /&gt;2. Eğer algoritma ilk iki yıla bakarak geliştirilse ve son iki yıl üzerinde test edildiğinde bu başarıyı gösterse fikriniz değişir miydi?&lt;br /&gt;3. Eğer algoritma son 10 güne değil son 5 güne bakarak bu başarıyı gösterse fikriniz değişir miydi?&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-1257444390874997646?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://tech.groups.yahoo.com/group/ariteknokent/message/1062' title='Olasılık ve kehanet'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/1257444390874997646/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=1257444390874997646' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/1257444390874997646'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/1257444390874997646'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2008/04/olaslk-ve-kehanet.html' title='Olasılık ve kehanet'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-3103413736280125999</id><published>2007-12-31T16:39:00.013+02:00</published><updated>2010-11-03T09:08:54.405+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Notes'/><title type='text'>SRILM ngram smoothing notes</title><content type='html'>This is a unix man page I wrote explaining the smoothing methods used in the SRILM statistical language modeling toolkit. &lt;span class="fullpost"&gt;&lt;br /&gt;&lt;br /&gt;&lt;H1&gt;ngram-discount&lt;/H1&gt; &lt;H2&gt; NAME &lt;/H2&gt; ngram-discount - notes on the N-gram smoothing implementations in SRILM &lt;H2&gt; NOTATION &lt;/H2&gt; &lt;DL&gt; &lt;DT&gt;&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;&lt;I&gt;&lt;/I&gt;&lt;I&gt;&lt;/I&gt; &lt;DD&gt; An N-gram where &lt;I&gt; a &lt;/I&gt; is the first word, &lt;I&gt; z &lt;/I&gt; is the last word, and "_" represents 0 or more words in between. &lt;DT&gt;&lt;I&gt;p&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;)&lt;I&gt;&lt;/I&gt; &lt;DD&gt; The estimated conditional probability of the &lt;I&gt;n&lt;/I&gt;th word &lt;I&gt; z &lt;/I&gt; given the first &lt;I&gt;n&lt;/I&gt;-1 words (&lt;I&gt;a&lt;/I&gt;_)&lt;I&gt;&lt;/I&gt;&lt;I&gt;&lt;/I&gt; of an N-gram. &lt;DT&gt;&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;&lt;/I&gt;&lt;I&gt;&lt;/I&gt;&lt;I&gt;&lt;/I&gt; &lt;DD&gt; The &lt;I&gt;n&lt;/I&gt;-1 word prefix of the N-gram &lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;.&lt;I&gt;&lt;/I&gt;&lt;I&gt;&lt;/I&gt; &lt;DT&gt;_&lt;I&gt;z&lt;/I&gt;&lt;I&gt;&lt;/I&gt;&lt;I&gt;&lt;/I&gt; &lt;DD&gt; The &lt;I&gt;n&lt;/I&gt;-1 word suffix of the N-gram &lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;.&lt;I&gt;&lt;/I&gt;&lt;I&gt;&lt;/I&gt; &lt;DT&gt;&lt;I&gt;c&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;)&lt;I&gt;&lt;/I&gt; &lt;DD&gt; The count of N-gram &lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;&lt;I&gt;&lt;/I&gt;&lt;I&gt;&lt;/I&gt; in the training data. &lt;DT&gt;&lt;I&gt;n&lt;/I&gt;(*_&lt;I&gt;z&lt;/I&gt;)&lt;I&gt;&lt;/I&gt;&lt;I&gt;&lt;/I&gt; &lt;DD&gt; The number of unique N-grams that match a given pattern. "(*)" represents a wildcard matching a single word. &lt;DT&gt;&lt;I&gt;n1&lt;/I&gt;,&lt;I&gt;n&lt;/I&gt;[1]&lt;I&gt;&lt;/I&gt;&lt;I&gt;&lt;/I&gt; &lt;DD&gt; The number of unique N-grams with count = 1. &lt;/DD&gt; &lt;/DL&gt; &lt;H2&gt; DESCRIPTION &lt;/H2&gt; &lt;P&gt; N-gram models try to estimate the probability of a word &lt;I&gt; z &lt;/I&gt; in the context of the previous &lt;I&gt;n&lt;/I&gt;-1 words (&lt;I&gt;a&lt;/I&gt;_),&lt;I&gt;&lt;/I&gt;&lt;I&gt;&lt;/I&gt; i.e., &lt;I&gt;Pr&lt;/I&gt;(&lt;I&gt;z&lt;/I&gt;|&lt;I&gt;a&lt;/I&gt;_).&lt;I&gt;&lt;/I&gt; We will denote this conditional probability using &lt;I&gt;p&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;)&lt;I&gt;&lt;/I&gt; for convenience. One way to estimate &lt;I&gt;p&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;)&lt;I&gt;&lt;/I&gt; is to look at the number of times word &lt;I&gt; z &lt;/I&gt; has followed the previous &lt;I&gt;n&lt;/I&gt;-1 words (&lt;I&gt;a&lt;/I&gt;_):&lt;I&gt;&lt;/I&gt;&lt;I&gt;&lt;/I&gt; &lt;PRE&gt;(1) &lt;I&gt;p&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;) = &lt;I&gt;c&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;)/&lt;I&gt;c&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_) &lt;/PRE&gt; This is known as the maximum likelihood (ML) estimate. Unfortunately it does not work very well because it assigns zero probability to N-grams that have not been observed in the training data. To avoid the zero probabilities, we take some probability mass from the observed N-grams and distribute it to unobserved N-grams. Such redistribution is known as smoothing or discounting. &lt;P&gt; Most existing smoothing algorithms can be described by the following equation: &lt;PRE&gt;(2) &lt;I&gt;p&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;) = (&lt;I&gt;c&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;) &amp;gt; 0) ? &lt;I&gt;f&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;) : bow(&lt;I&gt;a&lt;/I&gt;_) &lt;I&gt;p&lt;/I&gt;(_&lt;I&gt;z&lt;/I&gt;) &lt;/PRE&gt; If the N-gram &lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;&lt;I&gt;&lt;/I&gt;&lt;I&gt;&lt;/I&gt; has been observed in the training data, we use the distribution &lt;I&gt;f&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;).&lt;I&gt;&lt;/I&gt; Typically &lt;I&gt;f&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;)&lt;I&gt;&lt;/I&gt; is discounted to be less than the ML estimate so we have some leftover probability for the &lt;I&gt; z &lt;/I&gt; words unseen in the context (&lt;I&gt;a&lt;/I&gt;_).&lt;I&gt;&lt;/I&gt;&lt;I&gt;&lt;/I&gt; Different algorithms mainly differ on how they discount the ML estimate to get &lt;I&gt;f&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;).&lt;I&gt;&lt;/I&gt; &lt;P&gt; If the N-gram &lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;&lt;I&gt;&lt;/I&gt;&lt;I&gt;&lt;/I&gt; has not been observed in the training data, we use the lower order distribution &lt;I&gt;p&lt;/I&gt;(_&lt;I&gt;z&lt;/I&gt;).&lt;I&gt;&lt;/I&gt;&lt;I&gt;&lt;/I&gt; If the context has never been observed (&lt;I&gt;c&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_) = 0), we can use the lower order distribution directly (bow(&lt;I&gt;a&lt;/I&gt;_) = 1). Otherwise we need to compute a backoff weight (bow) to make sure probabilities are normalized: &lt;PRE&gt; Sum_&lt;I&gt;z&lt;/I&gt; &lt;I&gt;p&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;) = 1 &lt;/PRE&gt; &lt;P&gt; Let &lt;I&gt; Z &lt;/I&gt; be the set of all words in the vocabulary, &lt;I&gt; Z0 &lt;/I&gt; be the set of all words with &lt;I&gt;c&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;) = 0, and &lt;I&gt; Z1 &lt;/I&gt; be the set of all words with &lt;I&gt;c&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;) &amp;gt; 0. Given &lt;I&gt;f&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;),&lt;I&gt;&lt;/I&gt; bow(&lt;I&gt;a&lt;/I&gt;_)&lt;I&gt;&lt;/I&gt;&lt;I&gt;&lt;/I&gt; can be determined as follows: &lt;PRE&gt;(3) Sum_&lt;I&gt;Z&lt;/I&gt;  &lt;I&gt;p&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;) = 1&lt;br /&gt; Sum_&lt;I&gt;Z1&lt;/I&gt; &lt;I&gt;f&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;) + Sum_&lt;I&gt;Z0&lt;/I&gt; bow(&lt;I&gt;a&lt;/I&gt;_) &lt;I&gt;p&lt;/I&gt;(_&lt;I&gt;z&lt;/I&gt;) = 1&lt;br /&gt; bow(&lt;I&gt;a&lt;/I&gt;_) = (1 - Sum_&lt;I&gt;Z1&lt;/I&gt; &lt;I&gt;f&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;)) / Sum_&lt;I&gt;Z0&lt;/I&gt; &lt;I&gt;p&lt;/I&gt;(_&lt;I&gt;z&lt;/I&gt;)&lt;br /&gt;         = (1 - Sum_&lt;I&gt;Z1&lt;/I&gt; &lt;I&gt;f&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;)) / (1 - Sum_&lt;I&gt;Z1&lt;/I&gt; &lt;I&gt;p&lt;/I&gt;(_&lt;I&gt;z&lt;/I&gt;))&lt;br /&gt;         = (1 - Sum_&lt;I&gt;Z1&lt;/I&gt; &lt;I&gt;f&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;)) / (1 - Sum_&lt;I&gt;Z1&lt;/I&gt; &lt;I&gt;f&lt;/I&gt;(_&lt;I&gt;z&lt;/I&gt;)) &lt;/PRE&gt; &lt;P&gt; Smoothing is generally done in one of two ways. The backoff models compute &lt;I&gt;p&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;)&lt;I&gt;&lt;/I&gt; based on the N-gram counts &lt;I&gt;c&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;)&lt;I&gt;&lt;/I&gt; when &lt;I&gt;c&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;) &amp;gt; 0, and only consider lower order counts &lt;I&gt;c&lt;/I&gt;(_&lt;I&gt;z&lt;/I&gt;)&lt;I&gt;&lt;/I&gt;&lt;I&gt;&lt;/I&gt; when &lt;I&gt;c&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;) = 0. Interpolated models take lower order counts into account when &lt;I&gt;c&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;) &amp;gt; 0 as well. A common way to express an interpolated model is: &lt;PRE&gt;(4) &lt;I&gt;p&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;) = &lt;I&gt;g&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;) + bow(&lt;I&gt;a&lt;/I&gt;_) &lt;I&gt;p&lt;/I&gt;(_&lt;I&gt;z&lt;/I&gt;) &lt;/PRE&gt; Where &lt;I&gt;g&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;) = 0 when &lt;I&gt;c&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;) = 0 and it is discounted to be less than the ML estimate when &lt;I&gt;c&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;) &amp;gt; 0 to reserve some probability mass for the unseen &lt;I&gt; z &lt;/I&gt; words. Given &lt;I&gt;g&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;),&lt;I&gt;&lt;/I&gt; bow(&lt;I&gt;a&lt;/I&gt;_)&lt;I&gt;&lt;/I&gt;&lt;I&gt;&lt;/I&gt; can be determined as follows: &lt;PRE&gt;(5) Sum_&lt;I&gt;Z&lt;/I&gt;  &lt;I&gt;p(&lt;/I&gt;&lt;I&gt;a_&lt;/I&gt;&lt;I&gt;z)&lt;/I&gt; = 1&lt;br /&gt; Sum_&lt;I&gt;Z1&lt;/I&gt; &lt;I&gt;g(&lt;/I&gt;&lt;I&gt;a_&lt;/I&gt;&lt;I&gt;z&lt;/I&gt;) + Sum_&lt;I&gt;Z&lt;/I&gt; bow(&lt;I&gt;a&lt;/I&gt;_) &lt;I&gt;p&lt;/I&gt;(_&lt;I&gt;z&lt;/I&gt;) = 1&lt;br /&gt; bow(&lt;I&gt;a&lt;/I&gt;_) = 1 - Sum_&lt;I&gt;Z1&lt;/I&gt; &lt;I&gt;g&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;) &lt;/PRE&gt; &lt;P&gt; An interpolated model can also be expressed in the form of equation (2), which is the way it is represented in the ARPA format model files in SRILM: &lt;PRE&gt;(6) &lt;I&gt;f&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;) = &lt;I&gt;g&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;) + bow(&lt;I&gt;a&lt;/I&gt;_) &lt;I&gt;p&lt;/I&gt;(_&lt;I&gt;z&lt;/I&gt;)&lt;br /&gt; &lt;I&gt;p&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;) = (&lt;I&gt;c&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;) &amp;gt; 0) ? &lt;I&gt;f&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;) : bow(&lt;I&gt;a&lt;/I&gt;_) &lt;I&gt;p&lt;/I&gt;(_&lt;I&gt;z&lt;/I&gt;) &lt;/PRE&gt; &lt;P&gt; Most algorithms in SRILM have both backoff and interpolated versions. Empirically, interpolated algorithms usually do better than the backoff ones, and Kneser-Ney does better than others. &lt;H2&gt; OPTIONS &lt;/H2&gt; &lt;P&gt; This section describes the formulation of each discounting option in &lt;A HREF="http://www.speech.sri.com/projects/srilm/manpages/ngram-count.1.html"&gt;ngram-count(1)&lt;/A&gt;. After giving the motivation for each discounting method, we will give expressions for &lt;I&gt;f&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;)&lt;I&gt;&lt;/I&gt; and bow(&lt;I&gt;a&lt;/I&gt;_)&lt;I&gt;&lt;/I&gt;&lt;I&gt;&lt;/I&gt; of Equation 2 in terms of the counts. Note that some counts may not be included in the model file because of the &lt;B&gt; -gtmin &lt;/B&gt; options; see Warning 4 in the next section. &lt;P&gt; Backoff versions are the default but interpolated versions of most models are available using the &lt;B&gt; -interpolate &lt;/B&gt; option. In this case we will express &lt;I&gt;g&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_z&lt;I&gt;)&lt;/I&gt;&lt;I&gt;&lt;/I&gt; and bow(&lt;I&gt;a&lt;/I&gt;_)&lt;I&gt;&lt;/I&gt;&lt;I&gt;&lt;/I&gt; of Equation 4 in terms of the counts as well. Note that the ARPA format model files store the interpolated models and the backoff models the same way using &lt;I&gt;f&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;)&lt;I&gt;&lt;/I&gt; and bow(&lt;I&gt;a&lt;/I&gt;_);&lt;I&gt;&lt;/I&gt;&lt;I&gt;&lt;/I&gt; see Warning 3 in the next section. The conversion between backoff and interpolated formulations is given in Equation 6. &lt;P&gt; The discounting options may be followed by a digit (1-9) to indicate that only specific N-gram orders be affected. See &lt;A HREF="http://www.speech.sri.com/projects/srilm/manpages/ngram-count.1.html"&gt;ngram-count(1)&lt;/A&gt; for more details. &lt;DL&gt; &lt;DT&gt;&lt;B&gt;-cdiscount&lt;/B&gt;&lt;I&gt; D&lt;/I&gt;&lt;B&gt;&lt;/B&gt;&lt;I&gt;&lt;/I&gt;&lt;B&gt;&lt;/B&gt;&lt;I&gt;&lt;/I&gt;&lt;B&gt;&lt;/B&gt; &lt;DD&gt; Ney's absolute discounting using &lt;I&gt; D &lt;/I&gt; as the constant to subtract. &lt;I&gt; D &lt;/I&gt; should be between 0 and 1. If &lt;I&gt; Z1 &lt;/I&gt; is the set of all words &lt;I&gt; z &lt;/I&gt; with &lt;I&gt;c&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;) &amp;gt; 0: &lt;PRE&gt; &lt;I&gt;f&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;)  = (&lt;I&gt;c&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;) - &lt;I&gt;D&lt;/I&gt;) / &lt;I&gt;c&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_)&lt;br /&gt; &lt;I&gt;p&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;)  = (&lt;I&gt;c&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;) &amp;gt; 0) ? &lt;I&gt;f&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;) : bow(&lt;I&gt;a&lt;/I&gt;_) &lt;I&gt;p&lt;/I&gt;(_&lt;I&gt;z&lt;/I&gt;) ;Eq.2&lt;br /&gt; bow(&lt;I&gt;a&lt;/I&gt;_) = (1-Sum_&lt;I&gt;Z1&lt;/I&gt; f(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;)) / (1-Sum_&lt;I&gt;Z1&lt;/I&gt; &lt;I&gt;f&lt;/I&gt;(_&lt;I&gt;z&lt;/I&gt;))  ;Eq.3 &lt;/PRE&gt; With the &lt;B&gt; -interpolate &lt;/B&gt; option we have: &lt;PRE&gt; &lt;I&gt;g&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;)  = max(0, &lt;I&gt;c&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;) - &lt;I&gt;D&lt;/I&gt;) / &lt;I&gt;c&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_)&lt;br /&gt; &lt;I&gt;p&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;)  = &lt;I&gt;g&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;) + bow(&lt;I&gt;a&lt;/I&gt;_) &lt;I&gt;p&lt;/I&gt;(_&lt;I&gt;z&lt;/I&gt;) ;Eq.4&lt;br /&gt; bow(&lt;I&gt;a&lt;/I&gt;_) = 1 - Sum_&lt;I&gt;Z1&lt;/I&gt; &lt;I&gt;g&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;)      ;Eq.5&lt;br /&gt;         = &lt;I&gt;D&lt;/I&gt; &lt;I&gt;n&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_*) / &lt;I&gt;c&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_) &lt;/PRE&gt; The suggested discount factor is: &lt;PRE&gt; &lt;I&gt;D&lt;/I&gt; = &lt;I&gt;n1&lt;/I&gt; / (&lt;I&gt;n1&lt;/I&gt; + 2*&lt;I&gt;n2&lt;/I&gt;) &lt;/PRE&gt; where &lt;I&gt; n1 &lt;/I&gt; and &lt;I&gt; n2 &lt;/I&gt; are the total number of N-grams with exactly one and two counts, respectively. Different discounting constants can be specified for different N-gram orders using options &lt;B&gt;-cdiscount1&lt;/B&gt;,&lt;B&gt;&lt;/B&gt;&lt;B&gt;&lt;/B&gt;&lt;B&gt;&lt;/B&gt; &lt;B&gt;-cdiscount2&lt;/B&gt;,&lt;B&gt;&lt;/B&gt;&lt;B&gt;&lt;/B&gt;&lt;B&gt;&lt;/B&gt; etc. &lt;DT&gt;&lt;B&gt;-kndiscount&lt;/B&gt; and &lt;B&gt;-ukndiscount&lt;/B&gt;&lt;B&gt;&lt;/B&gt;&lt;B&gt;&lt;/B&gt; &lt;DD&gt; Kneser-Ney discounting. This is similar to absolute discounting in that the discounted probability is computed by subtracting a constant &lt;I&gt; D &lt;/I&gt; from the N-gram count. The options &lt;B&gt; -kndiscount &lt;/B&gt; and &lt;B&gt; -ukndiscount &lt;/B&gt; differ as to how this constant is computed. &lt;BR&gt; The main idea of Kneser-Ney is to use a modified probability estimate for lower order N-grams used for backoff. Specifically, the modified probability for a lower order N-gram is taken to be proportional to the number of unique words that precede it in the training data. With discounting and normalization we get: &lt;PRE&gt; &lt;I&gt;f&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;) = (&lt;I&gt;c&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;) - &lt;I&gt;D0&lt;/I&gt;) / &lt;I&gt;c&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_)  ;; for highest order&lt;br /&gt; &lt;I&gt;f&lt;/I&gt;(_&lt;I&gt;z&lt;/I&gt;)  = (&lt;I&gt;n&lt;/I&gt;(*_&lt;I&gt;z&lt;/I&gt;) - &lt;I&gt;D1&lt;/I&gt;) / &lt;I&gt;n&lt;/I&gt;(*_*) ;; for lower orders &lt;/PRE&gt; where the &lt;I&gt;n&lt;/I&gt;(*_&lt;I&gt;z&lt;/I&gt;)&lt;I&gt;&lt;/I&gt;&lt;I&gt;&lt;/I&gt; notation represents the number of unique N-grams that match a given pattern with (*) used as a wildcard for a single word. &lt;I&gt; D0 &lt;/I&gt; and &lt;I&gt; D1 &lt;/I&gt; represent two different discounting constants, as each N-gram order uses a different discounting constant. The resulting conditional probability and the backoff weight is calculated as given in equations (2) and (3): &lt;PRE&gt; &lt;I&gt;p&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;)  = (&lt;I&gt;c&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;) &amp;gt; 0) ? &lt;I&gt;f&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;) : bow(&lt;I&gt;a&lt;/I&gt;_) &lt;I&gt;p&lt;/I&gt;(_&lt;I&gt;z&lt;/I&gt;) ;Eq.2&lt;br /&gt; bow(&lt;I&gt;a&lt;/I&gt;_) = (1-Sum_&lt;I&gt;Z1&lt;/I&gt; f(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;)) / (1-Sum_&lt;I&gt;Z1&lt;/I&gt; &lt;I&gt;f&lt;/I&gt;(_&lt;I&gt;z&lt;/I&gt;))  ;Eq.3 &lt;/PRE&gt; The option &lt;B&gt; -interpolate &lt;/B&gt; is used to create the interpolated versions of &lt;B&gt; -kndiscount &lt;/B&gt; and &lt;B&gt;-ukndiscount&lt;/B&gt;.&lt;B&gt;&lt;/B&gt;&lt;B&gt;&lt;/B&gt;&lt;B&gt;&lt;/B&gt; In this case we have: &lt;PRE&gt; &lt;I&gt;p&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;) = &lt;I&gt;g&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;) + bow(&lt;I&gt;a&lt;/I&gt;_) &lt;I&gt;p&lt;/I&gt;(_&lt;I&gt;z&lt;/I&gt;)  ;Eq.4 &lt;/PRE&gt; Let &lt;I&gt; Z1 &lt;/I&gt; be the set {&lt;I&gt;z&lt;/I&gt;: &lt;I&gt;c&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;) &amp;gt; 0}. For highest order N-grams we have: &lt;PRE&gt; &lt;I&gt;g&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;)  = max(0, &lt;I&gt;c&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;) - &lt;I&gt;D&lt;/I&gt;) / &lt;I&gt;c&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_)&lt;br /&gt; bow(&lt;I&gt;a&lt;/I&gt;_) = 1 - Sum_&lt;I&gt;Z1&lt;/I&gt; &lt;I&gt;g&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;)&lt;br /&gt;         = 1 - Sum_&lt;I&gt;Z1&lt;/I&gt; &lt;I&gt;c&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;) / &lt;I&gt;c&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_) + Sum_&lt;I&gt;Z1&lt;/I&gt; &lt;I&gt;D&lt;/I&gt; / &lt;I&gt;c&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_)&lt;br /&gt;         = &lt;I&gt;D&lt;/I&gt; &lt;I&gt;n&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_*) / &lt;I&gt;c&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_) &lt;/PRE&gt; Let &lt;I&gt; Z2 &lt;/I&gt; be the set {&lt;I&gt;z&lt;/I&gt;: &lt;I&gt;n&lt;/I&gt;(*_&lt;I&gt;z&lt;/I&gt;) &amp;gt; 0}. For lower order N-grams we have: &lt;PRE&gt; &lt;I&gt;g&lt;/I&gt;(_&lt;I&gt;z&lt;/I&gt;)  = max(0, &lt;I&gt;n&lt;/I&gt;(*_&lt;I&gt;z&lt;/I&gt;) - &lt;I&gt;D&lt;/I&gt;) / &lt;I&gt;n&lt;/I&gt;(*_*)&lt;br /&gt; bow(_) = 1 - Sum_&lt;I&gt;Z2&lt;/I&gt; &lt;I&gt;g&lt;/I&gt;(_&lt;I&gt;z&lt;/I&gt;)&lt;br /&gt;        = 1 - Sum_&lt;I&gt;Z2&lt;/I&gt; &lt;I&gt;n&lt;/I&gt;(*_&lt;I&gt;z&lt;/I&gt;) / &lt;I&gt;n&lt;/I&gt;(*_*) + Sum_&lt;I&gt;Z2&lt;/I&gt; &lt;I&gt;D&lt;/I&gt; / &lt;I&gt;n&lt;/I&gt;(*_*)&lt;br /&gt;        = &lt;I&gt;D&lt;/I&gt; &lt;I&gt;n&lt;/I&gt;(_*) / &lt;I&gt;n&lt;/I&gt;(*_*) &lt;/PRE&gt; The original Kneser-Ney discounting (&lt;B&gt;-ukndiscount&lt;/B&gt;)&lt;B&gt;&lt;/B&gt;&lt;B&gt;&lt;/B&gt; uses one discounting constant for each N-gram order. These constants are estimated as &lt;PRE&gt; &lt;I&gt;D&lt;/I&gt; = &lt;I&gt;n1&lt;/I&gt; / (&lt;I&gt;n1&lt;/I&gt; + 2*&lt;I&gt;n2&lt;/I&gt;) &lt;/PRE&gt; where &lt;I&gt; n1 &lt;/I&gt; and &lt;I&gt; n2 &lt;/I&gt; are the total number of N-grams with exactly one and two counts, respectively. &lt;BR&gt; Chen and Goodman's modified Kneser-Ney discounting (&lt;B&gt;-kndiscount&lt;/B&gt;)&lt;B&gt;&lt;/B&gt;&lt;B&gt;&lt;/B&gt; uses three discounting constants for each N-gram order, one for one-count N-grams, one for two-count N-grams, and one for three-plus-count N-grams: &lt;PRE&gt; &lt;I&gt;Y&lt;/I&gt;   = &lt;I&gt;n1&lt;/I&gt;/(&lt;I&gt;n1&lt;/I&gt;+2*&lt;I&gt;n2&lt;/I&gt;)&lt;br /&gt; &lt;I&gt;D1&lt;/I&gt;  = 1 - 2&lt;I&gt;Y&lt;/I&gt;(&lt;I&gt;n2&lt;/I&gt;/&lt;I&gt;n1&lt;/I&gt;)&lt;br /&gt; &lt;I&gt;D2&lt;/I&gt;  = 2 - 3&lt;I&gt;Y&lt;/I&gt;(&lt;I&gt;n3&lt;/I&gt;/&lt;I&gt;n2&lt;/I&gt;)&lt;br /&gt; &lt;I&gt;D3+&lt;/I&gt; = 3 - 4&lt;I&gt;Y&lt;/I&gt;(&lt;I&gt;n4&lt;/I&gt;/&lt;I&gt;n3&lt;/I&gt;) &lt;/PRE&gt; &lt;DT&gt;&lt;B&gt; Warning: &lt;/B&gt; &lt;DD&gt; SRILM implements Kneser-Ney discounting by actually modifying the counts of the lower order N-grams.  Thus, when the &lt;B&gt; -write &lt;/B&gt; option is used to write the counts with &lt;B&gt; -kndiscount &lt;/B&gt; or &lt;B&gt;-ukndiscount&lt;/B&gt;,&lt;B&gt;&lt;/B&gt;&lt;B&gt;&lt;/B&gt;&lt;B&gt;&lt;/B&gt; only the highest order N-grams and N-grams that start with &amp;lt;s&amp;gt; will have their regular counts &lt;I&gt;c&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;),&lt;I&gt;&lt;/I&gt; all others will have the modified counts &lt;I&gt;n&lt;/I&gt;(*_&lt;I&gt;z&lt;/I&gt;)&lt;I&gt;&lt;/I&gt;&lt;I&gt;&lt;/I&gt; instead. See Warning 2 in the next section. &lt;DT&gt;&lt;B&gt; -wbdiscount &lt;/B&gt; &lt;DD&gt; Witten-Bell discounting. The intuition is that the weight given to the lower order model should be proportional to the probability of observing an unseen word in the current context (&lt;I&gt;a&lt;/I&gt;_).&lt;I&gt;&lt;/I&gt;&lt;I&gt;&lt;/I&gt; Witten-Bell computes this weight as: &lt;PRE&gt; bow(&lt;I&gt;a&lt;/I&gt;_) = &lt;I&gt;n&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_*) / (&lt;I&gt;n&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_*) + &lt;I&gt;c&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_)) &lt;/PRE&gt; Here &lt;I&gt;n&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_*)&lt;I&gt;&lt;/I&gt;&lt;I&gt;&lt;/I&gt; represents the number of unique words following the context (&lt;I&gt;a&lt;/I&gt;_)&lt;I&gt;&lt;/I&gt;&lt;I&gt;&lt;/I&gt; in the training data. Witten-Bell is originally an interpolated discounting method. So with the &lt;B&gt; -interpolate &lt;/B&gt; option we get: &lt;PRE&gt; &lt;I&gt;g&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;) = &lt;I&gt;c&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;) / (&lt;I&gt;n&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_*) + &lt;I&gt;c&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_))&lt;br /&gt; &lt;I&gt;p&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;) = &lt;I&gt;g&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;) + bow(&lt;I&gt;a&lt;/I&gt;_) &lt;I&gt;p&lt;/I&gt;(_&lt;I&gt;z&lt;/I&gt;)   ;Eq.4 &lt;/PRE&gt; Without the &lt;B&gt; -interpolate &lt;/B&gt; option we have the backoff version which is implemented by taking &lt;I&gt;f&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;)&lt;I&gt;&lt;/I&gt; to be the same as the interpolated &lt;I&gt;g&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;).&lt;I&gt;&lt;/I&gt; &lt;PRE&gt; &lt;I&gt;f&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;)  = &lt;I&gt;c&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;) / (&lt;I&gt;n&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_*) + &lt;I&gt;c&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_))&lt;br /&gt; &lt;I&gt;p&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;)  = (&lt;I&gt;c&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;) &amp;gt; 0) ? &lt;I&gt;f&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;) : bow(&lt;I&gt;a&lt;/I&gt;_) &lt;I&gt;p&lt;/I&gt;(_&lt;I&gt;z&lt;/I&gt;) ;Eq.2&lt;br /&gt; bow(&lt;I&gt;a&lt;/I&gt;_) = (1-Sum_&lt;I&gt;Z1&lt;/I&gt; &lt;I&gt;f&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;)) / (1-Sum_&lt;I&gt;Z1&lt;/I&gt; &lt;I&gt;f&lt;/I&gt;(_&lt;I&gt;z&lt;/I&gt;))  ;Eq.3 &lt;/PRE&gt; &lt;DT&gt;&lt;B&gt; -ndiscount &lt;/B&gt; &lt;DD&gt; Ristad's natural discounting law. See Ristad's technical report "A natural law of succession" for a justification of the discounting factor. The &lt;B&gt; -interpolate &lt;/B&gt; option has no effect, only a backoff version has been implemented. &lt;PRE&gt;         &lt;I&gt;c&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;)  &lt;I&gt;c&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_) (&lt;I&gt;c&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_) + 1) + &lt;I&gt;n&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_*) (1 - &lt;I&gt;n&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_*))&lt;br /&gt; &lt;I&gt;f&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;)= ------  ---------------------------------------&lt;br /&gt;         &lt;I&gt;c&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_)        &lt;I&gt;c&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_)^2 + &lt;I&gt;c&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_) + 2 &lt;I&gt;n&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_*)&lt;br /&gt;&lt;br /&gt; &lt;I&gt;p&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;)  = (&lt;I&gt;c&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;) &amp;gt; 0) ? &lt;I&gt;f&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;) : bow(&lt;I&gt;a&lt;/I&gt;_) &lt;I&gt;p&lt;/I&gt;(_&lt;I&gt;z&lt;/I&gt;) ;Eq.2&lt;br /&gt; bow(&lt;I&gt;a&lt;/I&gt;_) = (1-Sum_&lt;I&gt;Z1&lt;/I&gt; f(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;)) / (1-Sum_&lt;I&gt;Z1&lt;/I&gt; &lt;I&gt;f&lt;/I&gt;(_&lt;I&gt;z&lt;/I&gt;))  ;Eq.3 &lt;/PRE&gt; &lt;DT&gt;&lt;B&gt; -count-lm &lt;/B&gt; &lt;DD&gt; Estimate a count-based interpolated LM using Jelinek-Mercer smoothing (Chen &amp;amp; Goodman, 1998), also known as "deleted interpolation." Note that this does not produce a backoff model; instead of  count-LM parameter file in the format described in  &lt;A HREF="http://www.speech.sri.com/projects/srilm/manpages/ngram.1.html"&gt;ngram(1)&lt;/A&gt; needs to be specified using &lt;B&gt;-init-lm&lt;/B&gt;,&lt;B&gt;&lt;/B&gt;&lt;B&gt;&lt;/B&gt;&lt;B&gt;&lt;/B&gt; and a reestimated file in the same format is produced. In the process, the mixture weights that interpolate the ML estimates at all levels of N-grams are estimated using an expectation-maximization (EM) algorithm. The options &lt;B&gt; -em-iters &lt;/B&gt; and &lt;B&gt; -em-delta &lt;/B&gt; control termination of the EM algorithm. Note that the N-gram counts used to estimate the maximum-likelihood estimates are specified in the  &lt;B&gt; -init-lm &lt;/B&gt; model file. The counts specified with &lt;B&gt; -read &lt;/B&gt; or &lt;B&gt; -text &lt;/B&gt; are used only to estimate the interpolation weights. &lt;DT&gt;&lt;B&gt;-addsmooth&lt;/B&gt;&lt;I&gt; D&lt;/I&gt;&lt;B&gt;&lt;/B&gt;&lt;I&gt;&lt;/I&gt;&lt;B&gt;&lt;/B&gt;&lt;I&gt;&lt;/I&gt;&lt;B&gt;&lt;/B&gt; &lt;DD&gt; Smooth by adding  &lt;I&gt; D &lt;/I&gt; to each N-gram count. This is usually a poor smoothing method, included mainly for instructional purposes. &lt;PRE&gt; &lt;I&gt;p&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;) = (&lt;I&gt;c&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;) + &lt;I&gt;D&lt;/I&gt;) / (&lt;I&gt;c&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_) + &lt;I&gt;D&lt;/I&gt; &lt;I&gt;n&lt;/I&gt;(*)) &lt;/PRE&gt; &lt;DT&gt;default &lt;DD&gt; If the user does not specify any discounting options, &lt;B&gt; ngram-count &lt;/B&gt; uses Good-Turing discounting (aka Katz smoothing) by default. The Good-Turing estimate states that for any N-gram that occurs &lt;I&gt; r &lt;/I&gt; times, we should pretend that it occurs &lt;I&gt;r&lt;/I&gt;'&lt;I&gt;&lt;/I&gt;&lt;I&gt;&lt;/I&gt;&lt;I&gt;&lt;/I&gt; times where &lt;PRE&gt; &lt;I&gt;r&lt;/I&gt;' = (&lt;I&gt;r&lt;/I&gt;+1) &lt;I&gt;n&lt;/I&gt;[&lt;I&gt;r&lt;/I&gt;+1]/&lt;I&gt;n&lt;/I&gt;[&lt;I&gt;r&lt;/I&gt;] &lt;/PRE&gt; Here &lt;I&gt;n&lt;/I&gt;[&lt;I&gt;r&lt;/I&gt;]&lt;I&gt;&lt;/I&gt;&lt;I&gt;&lt;/I&gt; is the number of N-grams that occur exactly &lt;I&gt; r &lt;/I&gt; times in the training data.   &lt;BR&gt; Large counts are taken to be reliable, thus they are not subject to any discounting. By default unigram counts larger than 1 and other N-gram counts larger than 7 are taken to be reliable and maximum likelihood estimates are used. These limits can be modified using the &lt;B&gt;-gt&lt;/B&gt;&lt;I&gt;n&lt;/I&gt;&lt;B&gt;max&lt;/B&gt;&lt;I&gt;&lt;/I&gt;&lt;B&gt;&lt;/B&gt;&lt;I&gt;&lt;/I&gt;&lt;B&gt;&lt;/B&gt; options. &lt;PRE&gt; &lt;I&gt;f&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;) = (&lt;I&gt;c&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;) / &lt;I&gt;c&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_))  if &lt;I&gt;c&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;) &amp;gt; &lt;I&gt;gtmax&lt;/I&gt; &lt;/PRE&gt; The lower counts are discounted proportional to the Good-Turing estimate with a small correction &lt;I&gt; A &lt;/I&gt; to account for the high-count N-grams not being discounted. If 1 &amp;lt;= &lt;I&gt;c&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;) &amp;lt;= &lt;I&gt;gtmax&lt;/I&gt;: &lt;PRE&gt;                   &lt;I&gt;n&lt;/I&gt;[&lt;I&gt;gtmax&lt;/I&gt; + 1]&lt;br /&gt;  &lt;I&gt;A&lt;/I&gt; = (&lt;I&gt;gtmax&lt;/I&gt; + 1) --------------&lt;br /&gt;                      &lt;I&gt;n&lt;/I&gt;[1]&lt;br /&gt;&lt;br /&gt;                          &lt;I&gt;n&lt;/I&gt;[&lt;I&gt;c&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;) + 1]&lt;br /&gt;  &lt;I&gt;c&lt;/I&gt;'(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;) = (&lt;I&gt;c&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;) + 1) ---------------&lt;br /&gt;                            &lt;I&gt;n&lt;/I&gt;[&lt;I&gt;c&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;)]&lt;br /&gt;&lt;br /&gt;            &lt;I&gt;c&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;)   (&lt;I&gt;c&lt;/I&gt;'(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;) / &lt;I&gt;c&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;) - &lt;I&gt;A&lt;/I&gt;)&lt;br /&gt;  &lt;I&gt;f&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;) = --------  ----------------------&lt;br /&gt;             &lt;I&gt;c&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_)         (1 - &lt;I&gt;A&lt;/I&gt;) &lt;/PRE&gt; The &lt;B&gt; -interpolate &lt;/B&gt; option has no effect in this case, only a backoff version has been implemented, thus: &lt;PRE&gt; &lt;I&gt;p&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;)  = (&lt;I&gt;c&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;) &amp;gt; 0) ? &lt;I&gt;f&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;) : bow(&lt;I&gt;a&lt;/I&gt;_) &lt;I&gt;p&lt;/I&gt;(_&lt;I&gt;z&lt;/I&gt;)  ;Eq.2&lt;br /&gt; bow(&lt;I&gt;a&lt;/I&gt;_) = (1-Sum_&lt;I&gt;Z1&lt;/I&gt; &lt;I&gt;f&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;)) / (1-Sum_&lt;I&gt;Z1&lt;/I&gt; &lt;I&gt;f&lt;/I&gt;(_&lt;I&gt;z&lt;/I&gt;))   ;Eq.3 &lt;/PRE&gt; &lt;/DD&gt; &lt;/DL&gt; &lt;H2&gt; FILE FORMATS &lt;/H2&gt; SRILM can generate simple N-gram counts from plain text files with the following command: &lt;PRE&gt; ngram-count -order &lt;I&gt;N&lt;/I&gt; -text &lt;I&gt;file.txt&lt;/I&gt; -write &lt;I&gt;file.cnt&lt;/I&gt; &lt;/PRE&gt; The &lt;B&gt; -order &lt;/B&gt; option determines the maximum length of the N-grams. The file &lt;I&gt; file.txt &lt;/I&gt; should contain one sentence per line with tokens separated by whitespace. The output &lt;I&gt; file.cnt &lt;/I&gt; contains the N-gram tokens followed by a tab and a count on each line: &lt;PRE&gt; &lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt; &amp;lt;tab&amp;gt; &lt;I&gt;c&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;) &lt;/PRE&gt; A couple of warnings: &lt;DL&gt; &lt;DT&gt;&lt;B&gt; Warning 1 &lt;/B&gt; &lt;DD&gt; SRILM implicitly assumes an &amp;lt;s&amp;gt; token in the beginning of each line and an &amp;lt;/s&amp;gt; token at the end of each line and counts N-grams that start with &amp;lt;s&amp;gt; and end with &amp;lt;/s&amp;gt;. You do not need to include these tags in &lt;I&gt;file.txt&lt;/I&gt;.&lt;I&gt;&lt;/I&gt;&lt;I&gt;&lt;/I&gt;&lt;I&gt;&lt;/I&gt; &lt;DT&gt;&lt;B&gt; Warning 2 &lt;/B&gt; &lt;DD&gt; When &lt;B&gt; -kndiscount &lt;/B&gt; or &lt;B&gt; -ukndiscount &lt;/B&gt; options are used, the count file contains modified counts. Specifically, all N-grams of the maximum order, and all N-grams that start with &amp;lt;s&amp;gt; have their regular counts &lt;I&gt;c&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;),&lt;I&gt;&lt;/I&gt; but shorter N-grams that do not start with &amp;lt;s&amp;gt; have the number of unique words preceding them &lt;I&gt;n&lt;/I&gt;(*&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;)&lt;I&gt;&lt;/I&gt; instead. See the description of &lt;B&gt; -kndiscount &lt;/B&gt; and &lt;B&gt; -ukndiscount &lt;/B&gt; for details. &lt;/DD&gt; &lt;/DL&gt; &lt;P&gt; For most smoothing methods (except &lt;B&gt;-count-lm&lt;/B&gt;)&lt;B&gt;&lt;/B&gt;&lt;B&gt;&lt;/B&gt;&lt;B&gt;&lt;/B&gt; SRILM generates and uses N-gram model files in the ARPA format. A typical command to generate a model file would be: &lt;PRE&gt; ngram-count -order &lt;I&gt;N&lt;/I&gt; -text &lt;I&gt;file.txt&lt;/I&gt; -lm &lt;I&gt;file.lm&lt;/I&gt; &lt;/PRE&gt; The ARPA format output &lt;I&gt; file.lm &lt;/I&gt; will contain the following information about an N-gram on each line: &lt;PRE&gt; log10(&lt;I&gt;f&lt;/I&gt;(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;)) &amp;lt;tab&amp;gt; &lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt; &amp;lt;tab&amp;gt; log10(bow(&lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;)) &lt;/PRE&gt; Based on Equation 2, the first entry represents the base 10 logarithm of the conditional probability (logprob) for the N-gram &lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;.&lt;I&gt;&lt;/I&gt;&lt;I&gt;&lt;/I&gt; This is followed by the actual words in the N-gram separated by spaces. The last and optional entry is the base-10 logarithm of the backoff weight for (&lt;I&gt;n&lt;/I&gt;+1)-grams starting with &lt;I&gt;a&lt;/I&gt;_&lt;I&gt;z&lt;/I&gt;.&lt;I&gt;&lt;/I&gt;&lt;I&gt;&lt;/I&gt; &lt;DL&gt; &lt;DT&gt;&lt;B&gt; Warning 3 &lt;/B&gt; &lt;DD&gt; Both backoff and interpolated models are represented in the same format. This means interpolation is done during model building and represented in the ARPA format with logprob and backoff weight using equation (6). &lt;DT&gt;&lt;B&gt; Warning 4 &lt;/B&gt; &lt;DD&gt; Not all N-grams in the count file necessarily end up in the model file. The options &lt;B&gt;-gtmin&lt;/B&gt;,&lt;B&gt;&lt;/B&gt;&lt;B&gt;&lt;/B&gt;&lt;B&gt;&lt;/B&gt; &lt;B&gt;-gt1min&lt;/B&gt;,&lt;B&gt;&lt;/B&gt;&lt;B&gt;&lt;/B&gt;&lt;B&gt;&lt;/B&gt; ..., &lt;B&gt; -gt9min &lt;/B&gt; specify the minimum counts for N-grams to be included in the LM (not only for Good-Turing discounting but for the other methods as well). By default all unigrams and bigrams are included, but for higher order N-grams only those with count &amp;gt;= 2 are included. Some exceptions arise, because if one N-gram is included in the model file, all its prefix N-grams have to be included as well. This causes some higher order 1-count N-grams to be included when using KN discounting, which uses modified counts as described in Warning 2. &lt;DT&gt;&lt;B&gt; Warning 5 &lt;/B&gt; &lt;DD&gt; Not all N-grams in the model file have backoff weights. The highest order N-grams do not need a backoff weight. For lower order N-grams backoff weights are only recorded for those that appear as the prefix of a longer N-gram included in the model. For other lower order N-grams the backoff weight is implicitly 1 (or 0, in log representation). &lt;/DD&gt; &lt;/DL&gt; &lt;H2&gt; SEE ALSO &lt;/H2&gt; &lt;A HREF="http://www.speech.sri.com/projects/srilm/manpages/ngram.1.html"&gt;ngram(1)&lt;/A&gt;, &lt;A HREF="http://www.speech.sri.com/projects/srilm/manpages/ngram-count.1.html"&gt;ngram-count(1)&lt;/A&gt;, &lt;A HREF="http://www.speech.sri.com/projects/srilm/manpages/ngram-format.5.html"&gt;ngram-format(5)&lt;/A&gt;, &lt;BR&gt; S. F. Chen and J. Goodman, ``An Empirical Study of Smoothing Techniques for Language Modeling,'' TR-10-98, Computer Science Group, Harvard Univ., 1998. &lt;H2&gt; BUGS &lt;/H2&gt; Work in progress. &lt;H2&gt; AUTHOR &lt;/H2&gt; Deniz Yuret &amp;lt;dyuret@ku.edu.tr&amp;gt; &lt;BR&gt; Andreas Stolcke &amp;lt;stolcke@speech.sri.com&amp;gt; &lt;BR&gt; Copyright 2007 SRI International &lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-3103413736280125999?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://www.speech.sri.com/projects/srilm/' title='SRILM ngram smoothing notes'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/3103413736280125999/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=3103413736280125999' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/3103413736280125999'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/3103413736280125999'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2007/12/srilm-ngram-smoothing-notes.html' title='SRILM ngram smoothing notes'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-5449313061808047029</id><published>2007-12-16T15:42:00.002+02:00</published><updated>2010-11-03T09:08:54.406+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Türkçe'/><title type='text'>Verimli Düşünce Alışkanlıkları</title><content type='html'>&lt;i&gt;TÜSİAD'ın Eleştirel Düşünce Üzerine Raporundan&lt;/i&gt;&lt;br /&gt;Özet:                                                                                                                                                        Belirsiz bir gelecek karşısında doğru kararlar alabilmek için&lt;br /&gt;elimizdeki bilgilerden uygun sonuçlara ulaşabilmemiz gerekir.&lt;br /&gt;Bunun için önemli problemleri ilk başta keşfetmeli ve öncelik&lt;br /&gt;sırasına koyabilmeli, yaratıcı çözümler üretebilmeliyiz.  Bu&lt;br /&gt;çözümler arasında seçim yapabilmek için arkalarındaki varsayımları&lt;br /&gt;ve getirdikleri sonuçları görerek doğruluklarını tespit edebilmeli&lt;br /&gt;ve farklı çözümlerin destekledikleri çelişen değerler arasında&lt;br /&gt;denge kurabilmeliyiz.  Doğru düşünme konusunda bir eğitim programı&lt;br /&gt;planlarken hedef belirlemek, tam olarak ne istediğimizi anlamak&lt;br /&gt;önemlidir düşüncesindeyim.  Mantık, bilim, hukuk gibi disiplinler&lt;br /&gt;fikirleri değerlendirme konusunda kendilerine özgü düşünce&lt;br /&gt;yöntemleri geliştirmiştir.  Bu raporda amacım farklı disiplinlerin&lt;br /&gt;bize kazandırdıkları bu yöntemleri örneklendirmek, bu örneklerle&lt;br /&gt;geliştirilebilecek eğitim programları için değerlendirme&lt;br /&gt;kriterleri oluşturmaktır.  Raporun sonundaki ekte ise düşünce&lt;br /&gt;eğitimi konusunda dünyadan bulabildiğim uygulamaları&lt;br /&gt;özetleyeceğim.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-5449313061808047029?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://www.denizyuret.com/research/2007/criticalthinking/criticalthinking.pdf' title='Verimli Düşünce Alışkanlıkları'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/5449313061808047029/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=5449313061808047029' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/5449313061808047029'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/5449313061808047029'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2007/12/verimli-dusunce-alskanlklar.html' title='Verimli Düşünce Alışkanlıkları'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-8293312343391739529</id><published>2007-11-03T21:18:00.002+02:00</published><updated>2010-11-03T09:08:54.407+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Türkçe'/><title type='text'>DARPA Urban Challenge</title><content type='html'>Bir süredir heyecanla takip ettiğim bir yarışmadan bahsetmek&lt;br /&gt;istedim. DARPA uzun zamandır kendi kendine şoförsüz gidebilen&lt;br /&gt;arabalar geliştirmek istiyor: &lt;a href="http://www.darpa.mil/grandchallenge"&gt;http://www.darpa.mil/grandchallenge&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;Bu amaçla 2004 ve 2005'de iki yarışma düzenlediler. İlk yarışmada&lt;br /&gt;150 millik çöl parkurunu sağ salim bitirebilen bir araç çıkmadı.&lt;br /&gt;İkinci yarışmada ise 4-5 araba verilen zaman süresi içerisinde&lt;br /&gt;parkuru tamamladılar. Birinci olan Stanford takımından Sebastian&lt;br /&gt;Thrun'un konuşması izlemeye değer: &lt;span class="fullpost"&gt;&lt;br /&gt;&lt;a href="http://video.google.com/videoplay?docid=8594517128412883394&amp;amp;q=engedu"&gt;&lt;br /&gt;http://video.google.com/videoplay?docid=8594517128412883394&amp;amp;q=engedu&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;Bir sene içinde Stanford ve CMU'nun gösterdiği gelişmeyi görünce&lt;br /&gt;ben 2020'lerde ise giderken araba kullanmayacağımıza kani oldum.&lt;br /&gt;&lt;br /&gt;Bu sene üçüncü yarışma düzenleniyor. Bu sefer çölde değil şehir&lt;br /&gt;içinde trafikte yarışacak katılanlar. MİT ilk kez katılıyor.&lt;br /&gt;Yarışma bugün 8am'de (California time). Elemelerden görüntüler&lt;br /&gt;için aşağıdaki siteyi tavsiye ederim:&lt;br /&gt;&lt;a href="http://www.tgdaily.com/content/view/34686/113/"&gt;&lt;br /&gt;http://www.tgdaily.com/content/view/34686/113/&lt;/a&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-8293312343391739529?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://tech.groups.yahoo.com/group/ariteknokent/message/1033' title='DARPA Urban Challenge'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/8293312343391739529/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=8293312343391739529' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/8293312343391739529'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/8293312343391739529'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2007/11/darpa-urban-challenge.html' title='DARPA Urban Challenge'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-215599357590436426</id><published>2007-10-21T21:11:00.000+03:00</published><updated>2010-11-03T09:08:54.407+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Türkçe'/><title type='text'>King Kong ve Fizik</title><content type='html'>Hollywood filmlerinde görsel efektler için küçük maketler sıkça&lt;br /&gt;kullanılır. Örneğin bir evin patlamasını çekmek için gerçek bir&lt;br /&gt;evi parçalamak yerine küçük bir maket patlatılıp sonra film&lt;br /&gt;yavaşlatıldığında doğru izlenimi verir. &lt;span class="fullpost"&gt;&lt;br /&gt;&lt;br /&gt;Gelgelelim içerisinde su olan sahnelerde bu numara çalışmıyor.&lt;br /&gt;Water world filmi gelmiş geçmiş en çok para kaybeden filmlerden&lt;br /&gt;biri oldu bu yüzden, çünkü sahnelerin çoğu gerçek boyutta şu&lt;br /&gt;üzerinde çekilmek zorunda idi. Bir küvete küçük kayıklar koyup&lt;br /&gt;biraz suyu çalkalayın.  Filme çekip sonra yavaşlatın.  Dalgaların&lt;br /&gt;ve hareketlerin çok gerçekçi görünmediklerini farkedeceksiniz.&lt;br /&gt;Emrah'ın bahsettiği transformasyonlar nokta kütlelerden oluşan ve&lt;br /&gt;sadece yerçekiminin etkili olduğu sistemler için 100% çalışıyor,&lt;br /&gt;ama suyun akışkanlığı işi bozuyor.&lt;br /&gt;&lt;br /&gt;Bu örneği öğrendiğimden beri bu tip simetrilerin korunduğu ya da&lt;br /&gt;bozulduğu örnekler hep ilgimi çekmekte. Geçenlerde canlıların&lt;br /&gt;boyutlarıyla ilgili çok güzel iki yazı keşfettim:&lt;br /&gt;&lt;br /&gt;1. Favori bilim adamlarımdan JBS Haldane'nin hayvanların boyutlarının getirdiği fiziksel sınırlamalarla ilgili &lt;a href="http://www.physlink.com/Education/essay_haldane.cfm"&gt;bir yazısı&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;2. Bu fiziksel sonuçların ucuz hollywood sci-fi filmlerindeki devlere ve cücelere &lt;a href="http://fathom.lib.uchicago.edu/2/21701757"&gt;uygulanması&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;Özetle canlıları düşünürken temel kavram şu: bir yaratığın boyunu&lt;br /&gt;orantılı olarak iki katına çıkardığınızda, yüzey alanları dört&lt;br /&gt;katına, hacim ve kütleler sekiz katına çıkıyor. Bunun ilginç&lt;br /&gt;etkileri var.  Örneğin büyük hayvanlar yüksek yerlerden&lt;br /&gt;bırakıldıklarında daha hızlı düşüyorlar (Aristo'ya geri mi döndük&lt;br /&gt;ne :) Sebebi yerçekiminin uyguladığı kuvvetin kütle ile ama hava&lt;br /&gt;direncinin alan ile orantılı olması. Dolayısıyla yüksekten atlayan&lt;br /&gt;bir insan olurken, bir fareye hiç birşey olmuyor (terminal&lt;br /&gt;velocity 10km/h gibi). Bir at ise çorbaya dönüyor - eskiden&lt;br /&gt;bizanslılar katapultlarla hastalıklı atları kale duvarlarından&lt;br /&gt;içeriye fırlatıp hastalık yayarlarmış işgal sırasında. Filleri&lt;br /&gt;hayvanat bahçelerinde hapis tutan etraflarındaki zayıf çitler&lt;br /&gt;değil (onlar insanları dışarıda tutmak için), kazılmış olan 1m'lik&lt;br /&gt;hendek (fil oraya düşerse hayatta kalamayacağını biliyor).&lt;br /&gt;&lt;br /&gt;Diğer örnekleri kızım Asya'da (20 aylık) gözleyebiliyorum. Çok&lt;br /&gt;üşüyor (enerji üretimi kütle ile, işi kaybı ise deri yüzeyiyle&lt;br /&gt;doğru orantılı), dolayısıyla çok yiyor (orantılı olarak benim&lt;br /&gt;günde 15-20 lt süt içmem gibi), ve ağırlığına göre çok güçlü (kas&lt;br /&gt;ve kemik gücü kesit yüzey alanlarıyla, ağırlık ise hacimle&lt;br /&gt;orantılı).&lt;br /&gt;&lt;br /&gt;Bu basit ilişkileri kullanarak King Kong'un niye mümkün olmadığını&lt;br /&gt;(ilk zıpladığında tüm kemikleri kırılırdı), Brontosaurus'un niye&lt;br /&gt;kafasını yere yakın tutmak zorunda olduğunu (yukarıya uzatsa bacak&lt;br /&gt;damarları sıvı basıncına dayanamayıp patlar, beynine kan çıkmadığı&lt;br /&gt;için bayılırdı), dev böceklerin niye var olmadığını (akciğerleri&lt;br /&gt;olmadığı için hava almaları özmosa bağlı, oksijen o kadar derine&lt;br /&gt;penetre etmiyor), insanların niye sihirli ışınlarla mikroskopik&lt;br /&gt;boyutlara küçültülemeyeceğini (brownian motion, ışığın dalga boyu,&lt;br /&gt;enerji üretimi vs vs) açıklayabiliyoruz.&lt;br /&gt;&lt;br /&gt;Ne dersiniz, fizik 101 finalinde sorayım mı bunları bizim&lt;br /&gt;öğrencilere? ;)&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-215599357590436426?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://tech.groups.yahoo.com/group/ariteknokent/message/1020' title='King Kong ve Fizik'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/215599357590436426/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=215599357590436426' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/215599357590436426'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/215599357590436426'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2007/10/king-kong-ve-fizik.html' title='King Kong ve Fizik'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-4869325882277743167</id><published>2007-10-21T21:09:00.000+03:00</published><updated>2010-11-03T09:08:54.408+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Türkçe'/><title type='text'>Simetri üzerine</title><content type='html'>Simetri üzerine &lt;a href="http://tech.groups.yahoo.com/group/ariteknokent/message/1006"&gt;Emrah'ın bir yazısının&lt;/a&gt; düşündürdüğü bazı fikirler: &lt;span class="fullpost"&gt;&lt;br /&gt;&lt;br /&gt;1. Enerji, momentum ve açısal momentumun korunumunu zaman ve mekan&lt;br /&gt;simetrileriyle açıklama konusu: bunu hep fizikçi arkadaşlardan&lt;br /&gt;duyarım ama bir türlü derivasyonunu göremedim, bunun hepimizin&lt;br /&gt;anlayacağı basit bir derivasyonu var mı? Varsa niye liselerde bunu&lt;br /&gt;öğretmezler?&lt;br /&gt;&lt;br /&gt;2. Simetri kavramı matematik ve fizikte başlı başına bir eğitim&lt;br /&gt;konusu olabilir. Burada sadece not olarak simetrinin bu bağlamda&lt;br /&gt;günlük değil genelleştirilmiş anlamıyla, yani "yapılan bir&lt;br /&gt;değişiklikten sonra ilgilendiğimiz şeylerin aynı kalması" olarak&lt;br /&gt;kullanıldığını not düşmek isterim. Ama hala "simetri kaybı" gibi&lt;br /&gt;bazı terimlerin tam ne anlama geldiğini tam anlamış değilim.&lt;br /&gt;&lt;br /&gt;3. Bilimin başarısı lokal deneylerin bize çok uzak ve çok farklı&lt;br /&gt;ortamlarla ilgili geçerli bilgiler vermesine dayalı diyoruz.&lt;br /&gt;Psikoloji ve sosyolojinin başarısızlığının sebebini burada&lt;br /&gt;arayabilir miyiz? Örneğin bir insan, bir ülke, ya da bir tarih&lt;br /&gt;için gözlediğimiz şartlar başka bir insan, ülke, veya tarihte aynı&lt;br /&gt;sonucu vermiyorlar genelde (tabi "şartlar"ı yeterince ayrıntılı&lt;br /&gt;almamamızdan olabilir).  Dolayısıyla ortada basit bir insan, yer,&lt;br /&gt;ve zaman simetrisi yok gibi - bu da bilimin elindeki en güçlü&lt;br /&gt;araçlardan birini alıyor. Tarih ve ekonomi gibi bilimlerde "deney"&lt;br /&gt;yapmanın zorluğundan hep bahsedilir, ama belki esas problem deney&lt;br /&gt;yapabilsek bile bunların benzer sonuçlar vermeyecek oluşu. Bu&lt;br /&gt;olaylara belli simetrilerin olduğu seviyelerde bakmadığımız sürece&lt;br /&gt;durum ümitsiz gibi görünüyor.&lt;br /&gt;&lt;br /&gt;4. Neyin bilimsel olarak açıklanabilir neyin açıklanamaz olduğu&lt;br /&gt;konusunda önceki yazdılarımda "algorithmic complexity"'ye gönderme&lt;br /&gt;yaptım. Bir dizi (olaylar, sayılar, bitler dizisi), ya bir miktar&lt;br /&gt;açıklanabilir (compress edilebilir, predict edilebilir), ya da&lt;br /&gt;tamamiyle random'dur (random'luğun tanımını böyle yapıyor&lt;br /&gt;teori). Ama bu potansiyel olarak uzayabilecek tartışmayı sonraya&lt;br /&gt;bırakıyorum.&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-4869325882277743167?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://tech.groups.yahoo.com/group/ariteknokent/message/1020' title='Simetri üzerine'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/4869325882277743167/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=4869325882277743167' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/4869325882277743167'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/4869325882277743167'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2007/10/simetri-uzerine.html' title='Simetri üzerine'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-9145830611512312263</id><published>2007-10-21T20:42:00.002+03:00</published><updated>2010-11-03T09:08:54.409+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Türkçe'/><title type='text'>Düşünce deneylerine iyi bir örnek</title><content type='html'>Üniversitede fizik hocamdan duyduğum bir düşünce deneyi hikayesi&lt;br /&gt;geldi aklıma. Düşünce deneylerinin iyi düşünüldüğü zaman ne kadar&lt;br /&gt;etkili olabileceğine bir örnek: &lt;span class="fullpost"&gt;&lt;br /&gt;&lt;br /&gt;Aristo'nun fiziğinde düz giden cisimler ittirilmezlerse bir süre&lt;br /&gt;sonra dururlar. İki cisim bırakıldığında ağır olan önce&lt;br /&gt;düşer. Bunlar günlük deneyimimizle tutarlı gibi görünen&lt;br /&gt;sonuçlar. Ayrıntılı bir deney yapılmadığında bir çocuğa da&lt;br /&gt;sorsanız bu cevapları alırsınız basit düşünce deneylerinden. Bugün&lt;br /&gt;Aristo'yu suçluyoruz, adam niye bir kuleye çıkıp da denememiş&lt;br /&gt;diye. Ama suç gerçekten düşünce deneyinde mi?&lt;br /&gt;&lt;br /&gt;Dikkatle tasarlanmış bir düşünce deneyi bize gerçeği gösterebilir.&lt;br /&gt;Örneğin Pisa kulesinin tepesinden bir tuğla bıraktığımızda&lt;br /&gt;diyelim yere düşmesi 10 saniye sürsün. Birincisine eş ikinci bir&lt;br /&gt;tuğla bıraktığımızda onun da 10 saniyede düşeceğine herhalde&lt;br /&gt;Aristo'nun da bir şüphesi yoktu. Peki iki tuğlayı aynı anda&lt;br /&gt;bıraksak? Birbirlerini etkilediklerini düşünmediğimize göre yine&lt;br /&gt;birlikte on saniye sonra yere çarpacaklar. Şimdi çarpıcı noktaya&lt;br /&gt;geldik: iki eş tuğlanın arasına bir damla japon yapıştırıcısı&lt;br /&gt;damlatıp bunları iki kat büyüklükte tek bir kütle haline&lt;br /&gt;getirelim. Bu yeni kütleyi bıraktığımızda kaç saniyede düşmesini&lt;br /&gt;bekliyoruz? Bunun bir önceki deneyden gerçekten bir farkı var mı?&lt;br /&gt;Birinde iki tuğla yanyana düşüyor, diğerinde yine yanyana&lt;br /&gt;birbirlerine yapışık düşüyorlar.  Sanırım Aristo vaktiyle bu&lt;br /&gt;deneyi düşünse Galileo'dan 2000 yıl önce doğru çözümü gorebilirdi.&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-9145830611512312263?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://tech.groups.yahoo.com/group/ariteknokent/message/1019' title='Düşünce deneylerine iyi bir örnek'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/9145830611512312263/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=9145830611512312263' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/9145830611512312263'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/9145830611512312263'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2007/10/dusunce-deneylerine-iyi-bir-ornek.html' title='Düşünce deneylerine iyi bir örnek'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-1988085327665709433</id><published>2007-10-21T20:25:00.002+03:00</published><updated>2010-11-03T09:08:54.410+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Türkçe'/><title type='text'>Bilim budalalığı</title><content type='html'>Bilim budalalığı konusunda geçenlerde seyrettiğim Adam Curtis'in&lt;br /&gt;&lt;a href="http://www.youtube.com/watch?v=Bf0ORy4BzMY"&gt;Pandora's Box&lt;/a&gt; belgeselini tavsiye ederim.&lt;br /&gt;&lt;br /&gt;Altı bölümlük bu belgeselde yapımcı, komünist Rusya'nın rasyonel&lt;br /&gt;ve mekanik bir toplum kurma hayalinden, atomik enerji konusunda&lt;br /&gt;uğradığımız fiyaskolara, çevre sorunlarından, ekonomistlerin&lt;br /&gt;skandallarına kadar uzanan geniş bir yelpazede bilimin bize&lt;br /&gt;yaşattığı hayal kırıklıklarını incelemiş. &lt;span class="fullpost"&gt;&lt;br /&gt;&lt;br /&gt;Anlattığı altı hikayenin hepsinde benim görebildiğim senaryo&lt;br /&gt;benzer: belki geliştirilip olgunlaştırılması için on yıllar ya da&lt;br /&gt;yüz yıllar gerekebilecek teknolojiler bilim adamları tarafından&lt;br /&gt;kişisel sebeplerle over-sell ediliyor. Politikacılar gördükleri&lt;br /&gt;prototiplere güvenip halka tutamayacakları sözler&lt;br /&gt;veriyorlar. Sonuçta büyük bir zaman baskısı altında olgunlaşmamış&lt;br /&gt;teknolojiler üretime geçiyor.  Riskler hakkında yalanlar&lt;br /&gt;söyleniyor, doğruyu söylemeye çalışan dürüst bilim adamları&lt;br /&gt;susturuluyor. Sonunda beklenen felaket gerçekleştiğinde insanlar&lt;br /&gt;bilimi suçluyorlar.&lt;br /&gt;&lt;br /&gt;Bu senaryonun gösterdiği problemin çözümünü bilimde değil&lt;br /&gt;sosyoloji ve politikada aramak gerek diye düşünüyorum. Bilim&lt;br /&gt;sonuçta sadece "bilmek" kökünden gelen bir kelime, bilmenin de&lt;br /&gt;kimseye bir zararı yok (eğer Popper'ın açık toplumuna, Mill'in&lt;br /&gt;düşünce özgürlüğüne inanıyorsanız). Tüm felaketler bilmediğimiz&lt;br /&gt;konularda biliyormuş gibi davranmaktan ya da bildiğimiz bazı&lt;br /&gt;şeyleri saklamaktan geliyor.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://www.radikal.com.tr/haber.php?haberno=235240"&gt;Nuray Mert'in bilimle ilgili yazısını&lt;/a&gt; ise neresinden tutayım&lt;br /&gt;bilemiyorum. Memduh sanki sırf benim tüylerimi diken diken etmek&lt;br /&gt;amacıyla göndermiş.  Sanki çok bilime gark olmuş bir ülkeymişiz&lt;br /&gt;gibi... Aşağıda bir iki kavram karmaşası ve factual error&lt;br /&gt;konusunda yorum yapacağım sadece:&lt;br /&gt;&lt;br /&gt;1. "bilimin nefesinin tükendiği konular": burada bugünkü bilimin&lt;br /&gt;açıklayamadığı şeyler mi kasıt, yoksa prensipte açıklanamayacak&lt;br /&gt;şeyler mi emin değilim. Her halükarda yazının genelinden bilimsel&lt;br /&gt;yöntemle, günümüzün bilimsel bilgilerinin birbiriyle&lt;br /&gt;karıştırıldığı izlenimindeyim. Bugün bildiklerimizin yanlışlığının&lt;br /&gt;her an ispatlanabileceği bilimin tanımında var zaten. Ama&lt;br /&gt;yapabildiğimizin en iyisi bu, daha iyisini bilen varsa hodri&lt;br /&gt;meydan.&lt;br /&gt;&lt;br /&gt;2. "bu gayret bugüne kadar varoluşa dair temel soruları çözmekte&lt;br /&gt;işe yaramadı" ilginç bir gözlem. Sonuçta modern insan 150,000&lt;br /&gt;yıldır bu dünyada dolaşıyor. Üç yüz yıllık bir çaba sonucu evrenin&lt;br /&gt;en uzak noktalarıyla, maddenin en küçük yapıtaşlarıyla ilgili&lt;br /&gt;fikirler üretiyor, cansız moleküllere can verme noktasına&lt;br /&gt;geliyoruz. Nuray hanımı tatmin etmek için hangi "temel sorulara"&lt;br /&gt;cevap vermemiz gerekecek acaba?&lt;br /&gt;&lt;br /&gt;3. "Aydınlanma felsefesi, bilimsel düşünceyi temel alan dünya&lt;br /&gt;görüşünün, toplumsal siyasal meseleleri de halledeceğini iddia&lt;br /&gt;ediyor, umuyordu. Öyle olmadı."  Bu, eğer cümlenin öznesini yanlış&lt;br /&gt;okumadıysam "aydınlanma felsefesi" nin bir problemi, bilimin değil&lt;br /&gt;:) Tabi iki paragraf sonra yazarımız bilime karşı felsefeyi&lt;br /&gt;savunarak benim kafamı temelli karıştırıyor: "bu sorular hâlâ&lt;br /&gt;felsefi düzeyde tartılması gereken konularmış."&lt;br /&gt;&lt;br /&gt;Toplumsala bilimin uygulanması konusu yukarıda bahsi geçen&lt;br /&gt;belgeselin ilk bölümünde Rusya örneğiyle ele alınıyor. Bilim şu&lt;br /&gt;anda gerek insan psikolojisini gerek toplum davranışını başarılı&lt;br /&gt;bir şekilde açıklamaktan çok uzak. Bu gerçeği çarpıtıp&lt;br /&gt;bilmediğimiz konularda birşeyler biliyormuşuz gibi davranmak ancak&lt;br /&gt;insanların zayıflığı ve irrasyonelliğinin sonucu, fazla&lt;br /&gt;rasyonelliğin değil.&lt;br /&gt;&lt;br /&gt;4. "İnsanlar, ortaçağda da birbirlerinin gözünü oyuyordu, bugün de&lt;br /&gt;bu noktadan uzaklaşmış falan değiller."  Bu noktada Steven&lt;br /&gt;Pinker'in vahşetin tarihsel düşüşü ile ilgili &lt;a href="http://www.ted.com/index.php/talks/view/id/163"&gt;TED konuşmasını&lt;/a&gt;&lt;br /&gt;tavsiye ediyorum. Burada Nuray hanımın söyledikleri tamamiyle&lt;br /&gt;yanıltıcı. 20.  yüzyılın dünya savaşlarını bile düşündüğümüzde&lt;br /&gt;eğer ölüm oranı eski dünyanın kabile savaşları ile aynı olsaydı&lt;br /&gt;100 milyon değil 2 milyar insanın ölmesini beklerdik. Tanıdığınız&lt;br /&gt;erkeklerin yarısının 30 yaşını fazla geçmeden başka bir erkeğin&lt;br /&gt;eliyle ölmesi normdu birkaç yüzyıl öncesine kadar.&lt;br /&gt;&lt;br /&gt;5. "Okullarda bilimsel bilgi öğretilmesin diyen yok, ama bilimsel&lt;br /&gt;bilgi mutlak gerçek yerine konulmasın deme hakkımız yok mu?"&lt;br /&gt;Bilimsel bilginin mutlak gerçek olduğunu iddia eden var mı acaba?&lt;br /&gt;Bunun bilimin tanımına aykırı olduğunu açıklayarak okuyucularımın&lt;br /&gt;zamanını almayacağım.&lt;br /&gt;&lt;br /&gt;Belki de beni en rahatsız eden yazarın alternatif sunmayarak olan&lt;br /&gt;gelişmeleri tek taraflı eleştirmesi. Onu değil de şunu yapsaydık&lt;br /&gt;demiyor, geleceğe yönelik bir tavsiyede bulunmuyor. Hayata dair&lt;br /&gt;bilgilenme çeşitlerine başka başarılı örnekler sunmuyor.&lt;br /&gt;İnsanların ve kurumların yapılarından kaynaklanan bir takım&lt;br /&gt;sorunları bilimin suçuymuş gibi sunup her türlü alternatif&lt;br /&gt;crackpot felsefeye yeşil ışık yakıyor. Tam da günümüzde&lt;br /&gt;ihtiyacımız olan bakış açısı (!).&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-1988085327665709433?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://tech.groups.yahoo.com/group/ariteknokent/message/1018' title='Bilim budalalığı'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/1988085327665709433/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=1988085327665709433' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/1988085327665709433'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/1988085327665709433'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2007/10/bilim-budalalg.html' title='Bilim budalalığı'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-2706043937528288994</id><published>2007-10-21T20:22:00.001+03:00</published><updated>2010-11-03T09:08:54.410+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Türkçe'/><title type='text'>Nedenciler ve Nasılcılar</title><content type='html'>Nedenci nasılcı ayrımı konusunda bir süredir kafama takılan bir&lt;br /&gt;soru var. Memduh'un deyimiyle:&lt;br /&gt;&lt;br /&gt;&gt; Ama galiba bütün bu sorgulamaların sonunda cevap ya&lt;br /&gt;&gt; (i) fiziksel bir sebebe bağlanacak, veya&lt;br /&gt;&gt; (ii) (bireyin. evrimin, Tanrının) aklıyla, isteğiyle olacak&lt;br /&gt;&lt;br /&gt;Burada fiziksel bir sebepten kasıt eminim bugünkü bilimsel&lt;br /&gt;bilgimizle sınırlı bir açıklama değil. Örneğin bugün evrende dört&lt;br /&gt;ayrı kuvvet olduğunu düşünüyor tüm gözlemlerimizi bu dört kuvvetin&lt;br /&gt;bir sonucu olarak açıklamaya çalışıyoruz. Ama yarın bu dört&lt;br /&gt;kuvvetin açıklayamadığı bir deney yapılsa bu bilimin sonu olmaz,&lt;br /&gt;ya beşinci bir kuvvet eklenir, ya da kuvvetlerin ötesinde bambaşka&lt;br /&gt;bir kavram çatısı bulunur. &lt;span class="fullpost"&gt;&lt;br /&gt;&lt;br /&gt;Bilimi bu geniş anlamıyla aldığımızda bir olgunun (örneğin bilinç&lt;br /&gt;gibi) bilimsel yöntemle açıklanması ne demek önce ona bakalım:&lt;br /&gt;Başarılı bir bilimsel açıklama bize o güne kadar yapılmış&lt;br /&gt;gözlemlerle tutarlı ve gelecekte yapabileceğimiz gözlemlere&lt;br /&gt;yönelik test edilebilir tahminlerde bulunabileceğimiz bir hipotez,&lt;br /&gt;bir teori sunuyor. Dolayısıyla örneğin başarılı bir psikolojik&lt;br /&gt;teori bulduğumuz zaman (şu an böyle bir teori henüz yok) bu benim&lt;br /&gt;hangi koşullarda neyi isteyeceğimi, nelere inanacağımı, nasıl&lt;br /&gt;davranacağımı prensipte başarılı şekilde tahmin etmemizi&lt;br /&gt;sağlayabilir. Bu teoriyi kullanarak benzer özelliklere sahip&lt;br /&gt;robotlar, simülasyonlar geliştirebiliriz vs.&lt;br /&gt;&lt;br /&gt;Bu noktada ilk sorum şu: nedencilerin açıklama olarak&lt;br /&gt;kastettikleri şey tam olarak nedir? Örneğin tanrı istedi de öyle&lt;br /&gt;oldu, ya da ben istedim de öyle yaptım gibi ifadeler birer&lt;br /&gt;açıklama mı? Eğer öyle ise bu iki grup insanın "açıklama" kavramı&lt;br /&gt;ile kastettikleri çok ayrı şeyler. Dolayısıyla bu yanlış bir&lt;br /&gt;dikotomi, iki grup aynı sorulara cevap aramıyor. Nedencilerin&lt;br /&gt;açıklamalarında gelecekle ilgili tahminlerde bulunabilme isteği ya&lt;br /&gt;da potansiyeli yok. Zaten böyle tahminlerde bulunabilsek (örneğin&lt;br /&gt;bireylerin ya da Tanrı'nın yarın ne isteyeceği üzerine başarılı&lt;br /&gt;bir teorimiz olsa), o zaman nasılcılardan bir farkımız kalmazdı.&lt;br /&gt;&lt;br /&gt;Bir ihtimal nedenciler bazı şeylerin prensipte bilimsel yöntemle&lt;br /&gt;açıklanamayacağına inanıyorlar. Peki o zaman bir olgunun bilimsel&lt;br /&gt;yöntemle prensipte açıklanamaması ne demek? Örneğin Penrose&lt;br /&gt;bilinci açıklamak için yeni kuantum özellikler ararken bugünkü&lt;br /&gt;bilimin yetersiz olduğuna mı inanıyor, yoksa bilimin genel olarak&lt;br /&gt;bilinci açıklayamayacağına mı? Bilimsel açıklama ve tahminlere bir&lt;br /&gt;bakış açısı bugüne kadar gözlenmiş bir sayı dizisinin bundan&lt;br /&gt;sonraki terimlerini tahmin etmek olarak özetlenebilir. Bu durumda&lt;br /&gt;bu tahminin prensipte yapılamayacağını iddia etmek bu dizinin&lt;br /&gt;tamamiyle rastgele (random) olduğunu iddia etmeye denk. Bu durumda&lt;br /&gt;nedenciler Tanrı'nın ya da insan bilincinin bir seviyede püre&lt;br /&gt;randomness'dan geldiğine mi inanıyorlar?&lt;br /&gt;&lt;br /&gt;Diğer bir ihtimal, nedencilerin bilimsel açıklamalarla bir alıp&lt;br /&gt;veremedikleri yok, sadece bu tip açıklamalar onları&lt;br /&gt;ilgilendirmiyor, ya da onları tatmin etmiyor. Buna söyleyecek&lt;br /&gt;birşey yok tabi, ama o zaman ortada kavramsal olarak önemli bir&lt;br /&gt;dikotomi yerine kişisel tercihler var dememiz lazim...&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-2706043937528288994?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://tech.groups.yahoo.com/group/ariteknokent/message/1017' title='Nedenciler ve Nasılcılar'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/2706043937528288994/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=2706043937528288994' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/2706043937528288994'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/2706043937528288994'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2007/10/nedenciler-ve-naslclar.html' title='Nedenciler ve Nasılcılar'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-2174120052413816359</id><published>2007-10-01T12:54:00.007+03:00</published><updated>2010-11-03T08:14:40.095+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Students'/><title type='text'>Mehmet Ali Yatbaz, M.S. 2007</title><content type='html'>&lt;iframe src='http://docs.google.com/EmbedSlideshow?docid=d2jm3f3_925fpf82ncb' frameborder='0' width='410' height='342'&gt;&lt;/iframe&gt;&lt;br /&gt;&lt;span class="fullpost"&gt;&lt;br /&gt;&lt;b&gt;Stretch: A Feature Weighting Method for The k Nearest Neighbor Algorithms&lt;/b&gt;&lt;br /&gt;Mehmet Ali Yatbaz.  M.S. Thesis, Koç University Department of Computer Engineering, October 2007. &lt;br /&gt;&lt;br /&gt;Abstract: The k nearest neighbor learning algorithm (kNN) is one of the well studied nonparametric learning algorithms. kNN assumes that the underlying joint probability density function of the training set is unknown and it estimates the underlying joint probability density functions using the labeled data set (training set). Although this is a realistic assumption in terms of the real world problems, it introduces some limitations on the predictive&lt;br /&gt;accuracy, the storage complexity and computational compexity of the kNN.&lt;br /&gt;&lt;br /&gt;The goal of this thesis is to understand kNN and techniques that are used to increase the predictive accuracy of kNN. This thesis mainly focuses on the effect of the irrelevant features on the predictive accuracy of the kNN and introduces the Stretch method, a new preprocessing method to increase the predictive accuracy of kNN by doing linear transformation on the training data matrix. The method incrementally constructs a linear transformation that maximizes the nearest neighbor classification accuracy on the training set. At each iteration the method picks an instance from the data set, and computes a transformation that moves the instance closer to the instances with the same category and/or away from the instances in other categories.  The composition of these iterative linear transformations can lead to statistically significant improvements in kNN learning algorithms.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://www.denizyuret.com/students/myatbaz/Thesis.pdf"&gt;Download PDF&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-2174120052413816359?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://www.denizyuret.com/students/myatbaz/Thesis.pdf' title='Mehmet Ali Yatbaz, M.S. 2007'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/2174120052413816359/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=2174120052413816359' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/2174120052413816359'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/2174120052413816359'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2007/10/mehmet-ali-yatbaz-ms-2007.html' title='Mehmet Ali Yatbaz, M.S. 2007'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-3283081241740997973</id><published>2007-09-26T15:29:00.000+03:00</published><updated>2010-11-03T09:08:54.411+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Türkçe'/><title type='text'>Düşünce deneyleri</title><content type='html'>Bu aralar bilinç üzerine ne varsa okuyorum. Tartışmaların çoğu&lt;br /&gt;düşünce deneyleri üzerine dönüyor. Birkaç örnek vermek gerekirse: &lt;span class="fullpost"&gt;&lt;br /&gt;&lt;br /&gt;* Eğer bir yarasa olsak kendimizi nasıl hissederdik?&lt;br /&gt;&lt;br /&gt;* Star trek usulü bir scanner ile tüm atomlarınızı tarayıp sizi&lt;br /&gt;başka bir yerde yeniden oluştursak hangi sız gerçek siz olurdunuz?&lt;br /&gt;&lt;br /&gt;* Benimle aynı fiziksel konfigürasyona sahip olan (atomlar,&lt;br /&gt;moleküller) ama içinde bir "ben" olmayan bir yaratık (zombie)&lt;br /&gt;olabilir mi?&lt;br /&gt;&lt;br /&gt;* Gelecekte beynin tüm sırlarını çözmüş olalım. Mary, bir renk&lt;br /&gt;uzmanı, insanın renk algılarıyla ilgili bütün mekanizmaları en&lt;br /&gt;ince ayrıntısına kadar biliyor olsun. Ama Mary ömrü boyunca siyah&lt;br /&gt;beyaz bir odada oturup dünyayı siyah beyaz bir monitörden&lt;br /&gt;izlesin. Sonunda bir gün dışarı çıkıp gerçek renkleri gördüğünde&lt;br /&gt;yeni birşey öğrenir mi?&lt;br /&gt;&lt;br /&gt;Beni bu konuda düşünmeye iten küçük bir deneyi blog'uma koydum:&lt;br /&gt;&lt;br /&gt;&lt;a href="http://denizyuret.blogspot.com/2007/09/hearing.html"&gt;http://denizyuret.blogspot.com/2007/09/hearing.html&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;Düşünce deneylerinin kötü tarafı fiziksel deneyler gibi&lt;br /&gt;kendilerini tutarlı bir gerçeğe bağlayan güçlü bir bağ&lt;br /&gt;olmaması. (Fiziksel gerçeğin neden tutarlı olduğu ayrı bir&lt;br /&gt;tartışma konusu - elmanın dünyaya düşmesini sağlayan mekanizma ile&lt;br /&gt;ayı dünyanın etrafında döndüren mekanizmanın aynı kurallarla&lt;br /&gt;açıklanması, bu yüzyılda yaşamasak hayli şaşırtıcı olurdu.) O&lt;br /&gt;zaman düşünce deneylerinin gördüğü fonksiyon birer sezgi pompası&lt;br /&gt;(intuition pump) rolü oynayıp kafamızda zaten var olan bir takım&lt;br /&gt;sezgileri daha ön plana çıkararak eldeki probleme uygulamamızı&lt;br /&gt;sağlamak. Tabi başlangıçtaki sezgilerimiz yanlışsa vardığımız&lt;br /&gt;sonuçlar da yanlış oluyor.&lt;br /&gt;&lt;br /&gt;Tabi tüm düşünce deneyleri yukarıdakiler gibi yanıltıcı değil.&lt;br /&gt;Einstein usulü fiziksel "gedankenexperiment" lar fiziksel olarak&lt;br /&gt;yapılması mümkün olmayan (en azından o gün için) deneylerle&lt;br /&gt;insanların düşüncelerini organize etmeye yarayabiliyorlar.&lt;br /&gt;&lt;br /&gt;Yine de dikkatli olmak lazım. Sadece bazı filozofların zombie'ler&lt;br /&gt;olduğunu hayal edebiliyor olması zombie'lerin gerçekten&lt;br /&gt;olabileceğini ya da bilincin fizik dışı bir komponenti olduğunu&lt;br /&gt;ispatlamıyor. Ya da bazı filozofların "hisseden robot"&lt;br /&gt;yapılabileceğini hayal edemiyor olması da onların hayal gücünün&lt;br /&gt;bir problemi, herhangi bir gerçeğin ispatı değil.&lt;br /&gt;&lt;br /&gt;"Bir gün bir adaya düştüğünde..." diye başlayan düşünce&lt;br /&gt;deneylerinde de problem hayal gücümüzün sınırları ile bağımlı&lt;br /&gt;olmaları. Bir adaya düştüğümüzde gerçekten olabilecek pek çok şeyi&lt;br /&gt;şu anki sınırlı deneyimimizle hayal edemiyor olabiliriz. Fiziksel&lt;br /&gt;deneylerin güzelliği deneyi yapanın hayal gücüne bağımlı&lt;br /&gt;olmamaları ve bazan hayal etmeyi aklımızdan geçirmediğimiz&lt;br /&gt;sonuçlar verebilmeleri.&lt;br /&gt;&lt;br /&gt;Belki bundan da önemlisi Galile'nin İtalya da 1650'de yaptığı&lt;br /&gt;deneyin şu an İstanbul'da 2007'de (ya da Andromeda Galaksisinde,&lt;br /&gt;bundan bir milyar yıl sonra) da aynı sonucu vermesi. Bunu biliyor&lt;br /&gt;muyuz, inanıyor muyuz, iman mı ediyoruz orası tartisilir.&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-3283081241740997973?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://tech.groups.yahoo.com/group/ariteknokent/message/999' title='Düşünce deneyleri'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/3283081241740997973/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=3283081241740997973' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/3283081241740997973'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/3283081241740997973'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2007/09/dusunce-deneyleri.html' title='Düşünce deneyleri'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-8674312048788260299</id><published>2007-09-23T22:27:00.004+03:00</published><updated>2010-11-03T09:08:54.412+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Notes'/><title type='text'>Hearing</title><content type='html'>In his book "Auditory Scene Analysis", author Albert S. Bregman likens the ear canals to two narrow channels on the edge of a lake and sound waves to water waves:&lt;br /&gt;&lt;br /&gt;"Your friend digs two narrow channels up from the side of the lake.  Each is a few feet long, and a few inches wide, and they are spaced a few feet apart.  Halfway up each one, your friend stretches a handkerchief and fastens it to the sides of the channel.  As waves reach the side of the lake they travel up the channels and cause the two handkerchiefs to go into motion.  You are allowed to look only at the handkerchiefs and from their motions to answer a series of questions: How many boats are there on the lake, and where are they?  Which is the most powerful one?  Which one is closer?  Is the wind blowing?  Has any large object been dropped suddenly into the lake?"&lt;br /&gt;&lt;br /&gt;As impossible as this task sounds, it is analogous to the work performed by your auditory system.&lt;br /&gt;&lt;br /&gt;Here is a small experiment.  Listen to this first recording and try to guess what it is: &lt;span class="fullpost"&gt;&lt;br /&gt;&lt;br /&gt;&lt;embed src="http://ia341233.us.archive.org/2/items/doors/creep_64kb.mp3" autostart="false" loop="FALSE" height="40" width="140"&gt;&lt;/embed&gt;&lt;br /&gt;&lt;br /&gt;No, it is not some wild animal or an alien.  It is just human speech  (singing) and some instruments, slowed down.  Here is the original, if you are curious:&lt;br /&gt;&lt;br /&gt;&lt;embed src="http://ia341233.us.archive.org/2/items/doors/doors_64kb.mp3" autostart="false" loop="FALSE" height="40" width="140"&gt;&lt;/embed&gt;&lt;br /&gt;&lt;br /&gt;Now try to listen to the first recording again and see if you can figure out what the words are and where they start and end.  People in general do not have a good appreciation of how difficult the problems of perception are (unless they are trying to build a machine to solve these problems).  We have been working on speech recognition for decades but the best programs still do not perform very well except in very restricted contexts.  Yet we recognize speech so effortlessly that it is difficult to see what the big deal is.  Trying to recognize the words in the first recording may help you appreciate the computer's difficulty.&lt;br /&gt;&lt;br /&gt;The thing that struck me about the first recording when I first heard it was how different it "felt" than human speech.  Although it contained exactly the same information as the original its "quale" was different in philosopher-speak.  This reminded me of a famous thought experiment devised by philosopher Frank Jackson: Mary the color scientist.&lt;br /&gt;&lt;br /&gt;Mary lives in the far future when neuroscience is complete and scientists know everything there is to know about the physical processes in the brain.  She has studied and learned everything there is to know about color perception: the optics of the eye, the properties of colored objects, the processing of color information in the visual system, and how this information leads to actions, memories, feelings etc.  But Mary has been brought up all her life in a black and white room, she has never seen any colors at all.  One day Mary is let out of her black and white room and sees colors for the first time.  What happens?  Does she learn anything new?&lt;br /&gt;&lt;br /&gt;Frank Jackson argues that she obviously learns something fundamentally new: what red is like, its raw feel, its quale.&lt;br /&gt;&lt;br /&gt;When I listen to the first recording I think of an alien speech scientist, trying to decipher the message hidden in the signal.  The alien can train itself and become an expert at recognizing the words upon hearing the signal.  But will it ever get the same "quale" as we get when we listen to the original recording?&lt;br /&gt;&lt;br /&gt;Philosopher Dan Dennett argues that there are no such things as qualia and Mary will not learn anything new when she sees the colors for the first time.  That is, of course, if we take the premises seriously: that she knows EVERYTHING there is to know about color perception.  As counterintuitive as this sounds, I find that when the subject is the "mind", familiar and intuitive is usually wrong.&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-8674312048788260299?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/8674312048788260299/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=8674312048788260299' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/8674312048788260299'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/8674312048788260299'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2007/09/hearing.html' title='Hearing'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-3319244483401289143</id><published>2007-07-25T14:59:00.002+03:00</published><updated>2010-11-03T09:08:54.413+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Türkçe'/><title type='text'>Tanrının işine karışmak</title><content type='html'>Nicholas D. Kristof taşıyıcı annelik ve genetik manipülasyonlar&lt;br /&gt;konusunda yazdığı yazısını şöyle bitiriyor:&lt;br /&gt;&lt;br /&gt;"What should cross the line into illegality is fiddling with the&lt;br /&gt;heritable DNA of humans to make them smarter, faster or more pious&lt;br /&gt;or more deaf. That is playing God not just with a particular embryo&lt;br /&gt;but with our species, and we should ban it." &lt;br /&gt;&lt;br /&gt;"Playing God" deyimi bu tip tartışmalarda çok geçmeye başladı. Ne anlama geldiği belirsiz, içi boş, ama negatif duygular uyandıran ve dolayısıyla her türlü önyargıyı içinde barındırmaya açık bir deyim.  Temiz argümanlar kullanmanın önemli olduğunu düşünüyorum ve Kristof'un yazısının bu testi geçtiğine çok emin değilim. &lt;span class="fullpost"&gt;&lt;br /&gt;&lt;br /&gt;Bana Craig Venter'e son projesi için izin verip vermemeye karar veren&lt;br /&gt;grubun içinde bütün büyük dinlerden temsilcilerin olmasını&lt;br /&gt;hatırlatıyor. Temsilciler kendi inandıkları kitapları yorumlayıp&lt;br /&gt;"cansızdan canlı yaratma" konusunda bir yasak olup olmadığına karar&lt;br /&gt;vermeye çalıştılar. Din konusundaki görüşünüz ne olursa olsun, umarım&lt;br /&gt;bu tip konularda karar verirken ne gibi kriterler kullanacağımızın&lt;br /&gt;hepimizin üzerinde anlaşabileceğimiz şekilde daha net belirlenmesi&lt;br /&gt;gerektiği konusunda bana katılıyorsunuzdur.&lt;br /&gt;&lt;br /&gt;Bu kriterlerin ne olması gerektiği konusunda anlaşabilir miyiz?&lt;br /&gt;Ekonomik kriterlerin daha dikkatli analizi çözümün bir parçası&lt;br /&gt;olabilir. Dennett ve Drescher gibi filozofların üzerinde uğraştığı&lt;br /&gt;"Getting ought from is", yani temel gerçeklerden başlayıp etik&lt;br /&gt;prensiplere akıl yoluyla ulaşma projesi belki bir diğer parçası.&lt;br /&gt;&lt;br /&gt;Problemin zor tarafı hedefin ne olduğunu belirlemek gibi geliyor bana.&lt;br /&gt;Hedef tam olarak belirlendiğinde bunun uygulamaya nasıl döküleceği&lt;br /&gt;bir sosyal mühendislik problemi. İnsanların neslinin sürdürülmesi,&lt;br /&gt;refah içinde yaşanması, sağlık, mutluluk, nedir tam olarak amacımız.&lt;br /&gt;Önümüzdeki yüzyılda transhümanistlerin rüyaları gerçek olmaya başlarsa&lt;br /&gt;iki bin yıldır çok değişmeyen bazı temel hedefleri sorgulamamız&lt;br /&gt;gerekecek mi?&lt;br /&gt;&lt;br /&gt;Net bir hedef üzerinde anlaştığımızda ve bu hedefe bizi götürecek&lt;br /&gt;mühendislik çözümler bulunduğunda "playıng God" gibi içi boş&lt;br /&gt;terimlerle birbirimizi oyalamayı bırakacağız diye umuyorum.&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-3319244483401289143?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://tech.groups.yahoo.com/group/ariteknokent/message/994' title='Tanrının işine karışmak'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/3319244483401289143/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=3319244483401289143' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/3319244483401289143'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/3319244483401289143'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2007/07/tanrnn-isine-karsmak.html' title='Tanrının işine karışmak'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-5225236961314927223</id><published>2007-06-28T16:31:00.005+03:00</published><updated>2009-02-14T04:53:43.906+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Publications'/><title type='text'>The CoNLL 2007 Shared Task on Dependency Parsing</title><content type='html'>Joakim Nivre, Johan Hall, Sandra Kübler, Ryan McDonald, Jens Nilsson, Sebastian Riedel and Deniz Yuret.  In &lt;i&gt;Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL)&lt;/i&gt; &lt;span class="fullpost"&gt;&lt;br /&gt;&lt;br /&gt;Abstract: The Conference on Computational Natural Language Learning features a shared task, in which participants train and test their learning systems on the same data sets. In 2007, as in 2006, the shared task has been devoted to dependency parsing, this year with both a multilingual track and a domain adaptation track. In this paper, we define the tasks of the different tracks and describe how the data sets were created from existing treebanks for ten languages. In addition, we characterize the different approaches of the participating systems, report the test results, and provide&lt;br /&gt;a first analysis of these results.&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-5225236961314927223?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://www.aclweb.org/anthology-new/D/D07/D07-1096.pdf' title='The CoNLL 2007 Shared Task on Dependency Parsing'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/5225236961314927223/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=5225236961314927223' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/5225236961314927223'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/5225236961314927223'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2007/06/conll-2007-shared-task-on-dependency.html' title='The CoNLL 2007 Shared Task on Dependency Parsing'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-6346052213157664760</id><published>2007-06-26T15:23:00.001+03:00</published><updated>2010-11-03T09:08:54.414+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Türkçe'/><title type='text'>Evrenin israfkarlığı</title><content type='html'>İnsan genetiği belirlendiginde ilginç bir gerçek ortaya çıktı:&lt;br /&gt;Proteinleri tanımlayıp özelliklerimizi belirleyen genler DNA'nın&lt;br /&gt;sadece yüzde üçünü kaplıyorlar.  Bu biraz israf gibi gelmiyor mu&lt;br /&gt;insana? &lt;span class="fullpost"&gt;&lt;br /&gt;&lt;br /&gt;1. Micro-RNA'leri kodlayan DNA bölümleri de var, fakat&lt;br /&gt;micro-RNA'lerin sayısı protein kodlayan genlerin sayısı ile aynı&lt;br /&gt;büyüklükte gibi. Micro-RNA'lerin boyları da daha küçük olduğundan&lt;br /&gt;3% hadi çıkmış olsun 10%'a. Hala bir 90% var sanki "junk" olarak.&lt;br /&gt;&lt;br /&gt;2. Tabi bu bölge başka şeyler kodlayabilir, RNA kodlayan yerlerin&lt;br /&gt;birden fazla kopyası olabilir vs. Fakat DNA'de hiç birşey&lt;br /&gt;kodlamadığı belli olan uzun tekrarlayan diziler olduğunu biliyorum&lt;br /&gt;(ATATATATATA) gibi. İşten anlayanlarımız bunların DNA'nın ne&lt;br /&gt;kadarını kapsadığını belki bize bulabilir.&lt;br /&gt;&lt;br /&gt;3. İşin garibi bazı bakteriler bu konuda bizden çok daha verimli&lt;br /&gt;kodlama yapıyor, DNA'larının neredeyse 100%'unu kullanıyorlar&lt;br /&gt;(yanlış hatırlıyorsam düzeltin). 3% rakamı insanlara ve diğer&lt;br /&gt;gelişmiş hayvanlara özel.&lt;br /&gt;&lt;br /&gt;4. Konuyla alakası yok ama bazı basit bitkilerin insanlardan çok&lt;br /&gt;daha fazla gene sahip olması da ilginç değil mi?&lt;br /&gt;&lt;br /&gt;5. Peki insan vücudunun 60% su olmasına ne demeli.&lt;br /&gt;&lt;br /&gt;6. Ya da atomların içinin 99.9999999% boş olmasına? Katı madde&lt;br /&gt;dediğimiz şeylerin büyük oranda boş olması 20. yüzyıl fiziğinin en&lt;br /&gt;ilginç bulgularından biri. Eğer çekirdek bir sinek ise,&lt;br /&gt;elektronlar onun etrafında futbol sahası boyunda yörüngelerde&lt;br /&gt;dolaşıyorlar.&lt;br /&gt;&lt;br /&gt;7. Son olarak eğer çok-evrenli kuantum yorumları doğru ise evren&lt;br /&gt;her an neredeyse sonsuz dala ayrılmakta, bu dalların her biri&lt;br /&gt;diğerleri kadar gerçek, biz ise sadece bunlardan birini&lt;br /&gt;algılayabiliyoruz (tabi diğer kopyalarımız diğerlerini&lt;br /&gt;algılıyorlar) diğerleri ile ancak dikkatli fizik deneyleri&lt;br /&gt;sırasında etkileşebiliyoruz. Ne büyük israf demek geliyor insanın&lt;br /&gt;içinden. Ama elinde sonsuz miktarda kaynak olan bir Tanrı için&lt;br /&gt;israf anlamsız bir kavram olmalı :)&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-6346052213157664760?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://tech.groups.yahoo.com/group/ariteknokent/message/974' title='Evrenin israfkarlığı'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/6346052213157664760/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=6346052213157664760' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/6346052213157664760'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/6346052213157664760'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2007/06/evrenin-israfkarlg.html' title='Evrenin israfkarlığı'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-8538618039154381592</id><published>2007-06-26T14:25:00.002+03:00</published><updated>2010-11-03T09:08:54.415+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Türkçe'/><title type='text'>Düşünceleri okuyabilmek</title><content type='html'>Prague'daki Association for Computational Linguistics&lt;br /&gt;konferansındayım. Açılış konuşmasını Mitch Marcus yaptı. Kendisi&lt;br /&gt;"Machine Learning" alanındaki çalışmaları ve klasik ders kitabı&lt;br /&gt;ile tanınır. Son zamanlarda uğraştığı işleri anlattı, ilginç&lt;br /&gt;bulduğum için paylaşmak istedim:&lt;br /&gt;&lt;br /&gt;Araştırmada beyninizi tarayan fMRI aletinin içine giriyorsunuz.&lt;br /&gt;Karşınıza her on saniyede bir okumanız için bir kelime çıkıyor.&lt;br /&gt;fMRI'in çektiği beyin aktivite filminizi gördüğünüz kelimelerle&lt;br /&gt;ilişkilendirip daha sonra sadece beyin aktivitenizden düşündüğünüz&lt;br /&gt;kelimeyi bulmaya çalışıyorlar. &lt;span class="fullpost"&gt;&lt;br /&gt;&lt;br /&gt;İlk deneylerde iki grup kelime ile başlamışlar: araçlar&lt;br /&gt;(tornavida, çekiç vs), ve yerler (ev, ofis vs). Modeller&lt;br /&gt;düşündüğünüz kelimenin hangi sınıfa ait olduğunu 100% ayırt&lt;br /&gt;edebiliyor. Tam olarak hangi kelimeyi düşündüğünüz konusunda şu an&lt;br /&gt;performans biraz daha düşük.  İnsandan insana değişmesine rağmen&lt;br /&gt;70% - 95% arasında gibi sınırlı sayıda kelime kümeleri için. Daha&lt;br /&gt;sonra yapılan deneyler kelime okuma yerine aynı objelerin&lt;br /&gt;resimlerine bakma (performans bu durumda daha da iyi), resimlere&lt;br /&gt;bakarken öğrenilen beyin aktiviteleriyle daha sonra okuduğunuz&lt;br /&gt;kelimeleri tahmin etme (bu gözlenen aktivitenin tek bir modalıty'e&lt;br /&gt;(yazı, ses, görüntü gibi) özel olmadığını ve gerçekten o&lt;br /&gt;kavramları gösterdiğine işaret), bir insandan öğrenilen modellerle&lt;br /&gt;diğer insanın ne düşündüğünü tahmin etme (performans biraz daha&lt;br /&gt;düşse de hala çok fena değil), bir dilde kelimeler okuyarak&lt;br /&gt;öğrenilen modellerle başka dilde düşünülen kelimeleri ayırt etme,&lt;br /&gt;ilgili kelimelerden elde edilen bir modelle daha evvel görünmeyen&lt;br /&gt;kelimeleri sınıflandırma vs gibi.&lt;br /&gt;&lt;br /&gt;Tabi üç yaşından beri beyin okuma aletleri hayal eden bendeniz&lt;br /&gt;için çok heyecan verici bir konuşmaydı. Türkiye'ye döner dönmez&lt;br /&gt;Amerikan Hastanesine gidip fMRI aleti ile nasıl oynayabileceğimin&lt;br /&gt;yollarını araştırmak istiyorum.&lt;br /&gt;&lt;br /&gt;İşin ilginci herkes benim kadar heyecanlı değildi. Bazı&lt;br /&gt;arkadaşların bu teknolojinin gideceği yerleri düşünüp çok rahatsız&lt;br /&gt;olduklarını farkettim. fMRİ taşınabilir derecede küçülüp&lt;br /&gt;hassasiyet düşünülen cümleleri ayırt edecek kadar artırıldığında&lt;br /&gt;kafalarına aluminyum folyo sarıp dolaşmaları gerekecek.&lt;br /&gt;&lt;br /&gt;Bense bu mahremiyet konusunda artık teslim olmuş durumdayım. Nasıl&lt;br /&gt;olsa uzun vadede ümit yok, buna ne kadar erken alışırsak o kadar&lt;br /&gt;iyi.  İnsanlar kendi doğalarını kabul edip standartlarını ona göre&lt;br /&gt;ayarlamalı. Internet trafiğinin 50% porno olacağını Tim Berners&lt;br /&gt;Lee tahmin edemezdi herhalde. Ama başımıza taş yağdığı falan yok.&lt;br /&gt;&lt;br /&gt;Bu konuda bildiğim bir David Brin ilginç birşeyler düşünüyor -&lt;br /&gt;açılma güçlüler-güçsüzler, devlet-vatandaşlar vs arasında tek&lt;br /&gt;yönlü olursa problem büyük. O tek yönü engelleyemeyeceğimize göre&lt;br /&gt;en azından çift yönlü olması için bir kültür devrimine başlamamız&lt;br /&gt;gerek. İngiltere'de sokak kameralarını izleyen karakollardaki&lt;br /&gt;memurları vatandaşların da kameralarla izleyebilmesi önerisi&lt;br /&gt;gibi. Neyse bu konu başka bir mesaj eder.&lt;br /&gt;&lt;br /&gt;Bu teknolojinin beni heyecanlandıran getirileri ise saymakla&lt;br /&gt;bitmez.  Birincisi beyinde kavramların ve anlamların ne şekilde&lt;br /&gt;saklandığı konusunda 2500 senedir herkes birşeyler atıp tutuyor,&lt;br /&gt;hala en ufak bir fikrimiz yok. Önümüzdeki senelerde bu konuda&lt;br /&gt;büyük ilerleme sağlanabilir. Kurzweil'in her zaman iddia ettiği&lt;br /&gt;gibi Aİ'i önümüzdeki 50 yılda biz çözemesek de beyin resmetme&lt;br /&gt;aletleri yeterince hassaslaştığında reverse-engineering daha kolay&lt;br /&gt;bir çözüm olabilir.  Daha yakın dönemde bilgisayarlar, internet,&lt;br /&gt;ve hatta diğer insanlarla iletişimimiz için klavye, fare, cep&lt;br /&gt;telefonu gibi gereçlere gerek kalmayabilir - beyne enjekte&lt;br /&gt;edilecek küçük bir wireless internet çipi ile telepatı sonunda&lt;br /&gt;gerçek olabilir. (Tabi call-waiting ve answering-machine benzeri&lt;br /&gt;filtreler bu durumda daha da önem kazanıyor). Politika ve hukuk&lt;br /&gt;konusundaki potansiyeller - yalan makineleri yerine sanığın&lt;br /&gt;düşüncelerini okuyan bir laptop düşünün.  Amerikan başkanı seçimi&lt;br /&gt;kazandığında elini incile koyup standart laflar edeceğine,&lt;br /&gt;seçmenler kafasını fMRİ'in içine sokup gerçekten ne düşündüğünü&lt;br /&gt;öğrenmek isteyebilirler. Daha sayayım mi?&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-8538618039154381592?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://tech.groups.yahoo.com/group/ariteknokent/message/973' title='Düşünceleri okuyabilmek'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/8538618039154381592/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=8538618039154381592' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/8538618039154381592'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/8538618039154381592'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2007/06/dusunceleri-okuyabilmek.html' title='Düşünceleri okuyabilmek'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-4474635964851813660</id><published>2007-06-25T22:58:00.003+03:00</published><updated>2010-11-03T08:21:54.864+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Notes'/><title type='text'>Targeted Textual Entailments - A Proposal</title><content type='html'>1. Definition:  A targeted textual entailment (TTE) task uses entailment questions to test a specific competence of a system, such as word sense disambiguation, semantic relation recognition, or parsing.  Even if we do not know the best theory underlying a competence, we know what having that competence enables people to do. For example: &lt;span class="fullpost"&gt;&lt;br /&gt;&lt;br /&gt;1.1 WSD&lt;br /&gt;"They had a board meeting today."&lt;br /&gt;==&gt; "They had a committee meeting today."  [yes]&lt;br /&gt;==&gt; "They had a plank meeting today." [no]&lt;br /&gt;&lt;br /&gt;1.2 SemRel&lt;br /&gt;"John opened the car door."&lt;br /&gt;==&gt; "The door is part of the car." [yes]&lt;br /&gt;==&gt; "The car produced the door." [no]&lt;br /&gt;&lt;br /&gt;1.3 Parsing&lt;br /&gt;"I saw the bird using a telescope."&lt;br /&gt;==&gt; "I used a telescope" [yes]&lt;br /&gt;==&gt; "The bird used a telescope" [no]&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;2. Motivation: Targeted textual entailment tasks address the following issues:&lt;br /&gt;&lt;br /&gt;2.1 Currently most shared tasks use or favor a specific inventory, representation, or linguistic theory.  In WSD, WordNet is often used as the sense inventory although everybody complains about it.  We have FrameNet, Propbank, Nombank, various logical formalisms and different sets of noun-noun relations people work on in the semantic relations area.  The parsing community is split into a constituency group and a dependency group that rarely compare results.  Formulating TTE tasks in these fields will help test systems on a level playing field no matter which inventory, presentation, or linguistic theory they use.&lt;br /&gt;&lt;br /&gt;2.2 Large annotation efforts struggle to achieve high inter annotator agreement (ITA).  My hypothesis is most annotators understand the sentences they are supposed to annotate equally well, but do not understand the formalism good enough for consistent labeling.  By asking simple entailment questions where all they need to do is choose yes/no/uncertain, it is hoped that (i) no education in a particular formalism will be needed for annotators, (ii) annotation will proceed faster, and (iii) final ITA will be higher.  (We throw away the examples that get a lot of "uncertain" answers).&lt;br /&gt;&lt;br /&gt;3. Methodology: For a TTE task to be useful and challenging the examples should be chosen close to the border that separates the positives from the negatives.  In other words, the positive examples should have non-trivial alternatives and the negative examples should be "near-misses".  (My examples in Section 1 are not all very good according to this criteria).  For example in the WSD TTE task (which is basically lexical substitution), the substitute should be chosen such that (i) it is a near-synonym for one of the target's senses, and/or (ii) it has a high probability of occuring in the given context.  In the parsing task, examples should be based on decision points where a typical parser can go either way or where the n-best parses disagree.  This suggests that examples can be generated automatically by taking the best automated systems of the day and focusing on decisions about which they are least confident.  This type of "active learning" methodology will uncover weaknesses that the next generation systems can focus on.&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-4474635964851813660?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://www.aclweb.org/aclwiki/index.php?title=Recognizing_Textual_Entailment' title='Targeted Textual Entailments - A Proposal'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/4474635964851813660/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=4474635964851813660' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/4474635964851813660'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/4474635964851813660'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2007/06/targeted-textual-entailments-proposal.html' title='Targeted Textual Entailments - A Proposal'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-1418234644472025091</id><published>2007-06-23T16:24:00.006+03:00</published><updated>2010-11-03T08:27:27.311+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Publications'/><title type='text'>KU: Word Sense Disambiguation by Substitution</title><content type='html'>Deniz Yuret.  In &lt;i&gt;Proceedings of the Fourth International Workshop on Semantic Evaluations (SemEval-2007)&lt;/i&gt;&lt;span class="fullpost"&gt;&lt;br /&gt;&lt;br /&gt;&lt;iframe src='http://docs.google.com/EmbedSlideshow?docid=d2jm3f3_318c453bscv' frameborder='0' width='410' height='342'&gt;&lt;/iframe&gt;&lt;br /&gt;&lt;br /&gt;Abstract: Data sparsity is one of the main factors that make word sense disambiguation (WSD) difficult. To overcome this problem we need to find effective ways to use resources other than sense labeled data. In this paper I describe a WSD system that uses a statistical language model based on a large unannotated corpus. The model is used to evaluate the likelihood of various substitutes for a word in a given context. These likelihoods are then used to determine the best sense for the word in novel contexts. The resulting system participated in three tasks in the SemEval 2007 workshop. The WSD of prepositions task proved to be challenging for the system, possibly illustrating some of its limitations: e.g. not all words have good substitutes. The system achieved promising results for the English lexical sample and English lexical substitution tasks.&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-1418234644472025091?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://www.aclweb.org/anthology-new/W/W07/W07-2044.pdf' title='KU: Word Sense Disambiguation by Substitution'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/1418234644472025091/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=1418234644472025091' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/1418234644472025091'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/1418234644472025091'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2007/06/ku-word-sense-disambiguation-by.html' title='KU: Word Sense Disambiguation by Substitution'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-5359662721688056932</id><published>2007-06-23T16:20:00.007+03:00</published><updated>2010-11-03T08:27:27.311+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Publications'/><title type='text'>SemEval-2007 Task 04: Classification of Semantic Relations between Nominals</title><content type='html'>Roxana Girju, Preslav Nakov, Vivi Nastase, Stan Szpakowicz, Peter Turney and Deniz Yuret.  In &lt;i&gt;Proceedings of the Fourth International Workshop on Semantic Evaluations (SemEval-2007)&lt;/i&gt; &lt;span class="fullpost"&gt;&lt;br /&gt;&lt;br /&gt;&lt;iframe src='http://docs.google.com/EmbedSlideshow?docid=d2jm3f3_286cvd699g3' frameborder='0' width='410' height='342'&gt;&lt;/iframe&gt;&lt;br /&gt;&lt;br /&gt;Abstract: The NLP community has shown a renewed interest in deeper semantic analyses, among them automatic recognition of relations between pairs of words in a text. We present an evaluation task designed to provide a framework for comparing different approaches to classifying semantic relations between nominals in a sentence. This is part of SemEval, the 4th edition of the semantic evaluation event previously known as SensEval. We define the task, describe the training/test data and their creation, list the participating systems and discuss their results. There were 14 teams who submitted 15 systems.&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-5359662721688056932?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://www.aclweb.org/anthology-new/W/W07/W07-2003.pdf' title='SemEval-2007 Task 04: Classification of Semantic Relations between Nominals'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/5359662721688056932/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=5359662721688056932' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/5359662721688056932'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/5359662721688056932'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2007/06/semeval-2007-task-04-classification-of.html' title='SemEval-2007 Task 04: Classification of Semantic Relations between Nominals'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-7404189085901071097</id><published>2007-06-03T14:19:00.002+03:00</published><updated>2010-11-03T09:08:54.415+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Türkçe'/><title type='text'>İrade üzerine</title><content type='html'>Uzun zamandır bilinç ilintili konularda kafamı en çok kurcalayan&lt;br /&gt;problem "irade" (ya da "istek", "motivasyon", "dürtü") problemi.&lt;br /&gt;Sabah uyandığımızda neden hiç bir şey yapmamaktansa birşeyler&lt;br /&gt;yapmayı yeğliyoruz? Davranışlarımızı üreten temel kaynak nedir?&lt;br /&gt;Herhangi birşey yaptığımda "neden yaptım" diye düşünürsem, bir iki&lt;br /&gt;rasyonalizasyondan sonra son cevabım "canım öyle istedi de ondan"&lt;br /&gt;oluyor. Bu cevap (ya da genel olarak "canın birşey istemesi"&lt;br /&gt;kavramı) sizde de derin bir beyin sansürü şüphesi yaratmıyor mu?&lt;br /&gt;Eğer ben insanları kontrol etmek isteyen kötü bir uzaylı olsam,&lt;br /&gt;onlara zorla birşey yaptırmaya çalışmak yerine, beyinlerinin "can&lt;br /&gt;istemesi" sistemleri ile oynardım - böylece benim amaçlarımı kendi&lt;br /&gt;"canları istediği" için yerine getirirler ve canlarının neyi neden&lt;br /&gt;istediği konusunda bir fikre sahip olmadıkları için de varlığımdan&lt;br /&gt;şüphelenmezlerdi. &lt;span class="fullpost"&gt;&lt;br /&gt;&lt;br /&gt;Bu problemin önemi arada bir çalışma motivasyonumu yitirdiğimde&lt;br /&gt;kafama dank ediyor. İnsanın "istediği" birşeyi yapmasıyla&lt;br /&gt;"istemediği" birşeyi kendine zorla yaptırması arasında büyük fark&lt;br /&gt;var - ortaya çıkan "davranış" Skinner'in bakışıyla ayırdedilemez&lt;br /&gt;olsa da.&lt;br /&gt;&lt;br /&gt;Geçen sene ünlü biyolog J.B.S. Haldane'nin aşağıdaki sözünü&lt;br /&gt;bulunca bu konudaki merakımda yanlız olmadığımı farkettim:&lt;br /&gt;&lt;br /&gt;"Kendi motivasyonuma dair şahsi açıklamalarımın hemen her durumda&lt;br /&gt;tamamen uydurma olduğu sonucuna varmış bulunuyorum. Neyi neden&lt;br /&gt;yaptığımı bilmiyorum." -- J.B.S. Haldane&lt;br /&gt;&lt;br /&gt;İşin üzücü tarafı benim geldiğim yapay zeka kültüründe böyle bir&lt;br /&gt;problemin önemi vurgulanmıyor. Klasik bir yapay zeka sisteminde&lt;br /&gt;algılarla dünyanın durumu belirlenir, amaçlar doğrultusunda plan&lt;br /&gt;yapılır ve bu planlar davranışa dönüştürülür. Peki amaçlar nereden&lt;br /&gt;gelir? Programı yazandan tabi ki. Dolayısıyla üzerinde en az&lt;br /&gt;düşünülen kısım olagelmiş motivasyon. Geçenlerde kurzweilai.net'de&lt;br /&gt;modern YZ'cılardan Ben Goertzel'in kendi cognitive modeli üzerine&lt;br /&gt;yazdıklarını okurken aşağıdaki paragrafı gördüm (kötü olduğu için&lt;br /&gt;çevirmeye uğraşmayacağım):&lt;br /&gt;&lt;br /&gt;"And if a system can recognize itself, it can recognize&lt;br /&gt;probabilistic relationships between itself and various effects in&lt;br /&gt;the world. It can recognize patterns of the form "If I do X, then&lt;br /&gt;Y is likely to occur." This leads to the pattern known as will."&lt;br /&gt;-- Ben Goertzel&lt;br /&gt;&lt;br /&gt;Kısacası davranışlar ve olası sonuçları hakkındaki bilgiler ile bu&lt;br /&gt;davranışların hangilerinin ne zaman ne sebeple aktive edildiği&lt;br /&gt;konuları birbirine karıştırılmış.&lt;br /&gt;&lt;br /&gt;Bu genel bir körlük bilgisayarcılar arasında belki. Düşünürseniz&lt;br /&gt;dünyanın en sofistike programları vakitlerinin çoğunu sizin bir&lt;br /&gt;düğmeye basmanızı bekleyerek harcıyorlar! Kendi kendine günler&lt;br /&gt;boyunca ilginç birşeyler yapabilen bir program yazamadık bugüne&lt;br /&gt;kadar.  Örneğin Doug Lenat'ın doktora tezinde yazdığı ilginç&lt;br /&gt;matematiksel varsayımları otomatik olarak keşfetmeye çalışan AM&lt;br /&gt;programı toplama çıkarmadan başlayıp 1700'lerin matematiğine&lt;br /&gt;birkaç saat içinde geliyordu. Yıllar boyunca matematik düşünebilen&lt;br /&gt;ve ilginç şeyler bulmaya devam eden bir program yok. Uzun süre&lt;br /&gt;çalışan programlarımız proteinlerin fiziksel simülasyonunu yapmak,&lt;br /&gt;interneti indekslemek, uzaydan gelen mesajlarda düzen aramak gibi&lt;br /&gt;monoton, kendini devamlı tekrarlayan işler yapıyorlar. Belki&lt;br /&gt;istediklerimizi eksiksiz ve hatasız şekilde yerine getiren&lt;br /&gt;makineler tasarlama saplantısından kurtulmadığımız sürece,&lt;br /&gt;makinelerimizin bizim hayal edebildiklerimiz ötesinde birşey&lt;br /&gt;yapamaması son derece doğal.&lt;br /&gt;&lt;br /&gt;Motivasyon konusunda her ne kadar basit de olsa açıklayıcı&lt;br /&gt;modeller üreten tek grup "ethology" (hayvan davranışı)&lt;br /&gt;bilimi. Tinbergen, Lorenz gibi öncüler en azından basit&lt;br /&gt;hayvanların neyi neden yaptığıyla ilgili mekanik modeller&lt;br /&gt;geliştirmeye başlamışlar. Gallistel'in yazdığı "The Organization&lt;br /&gt;of Action" kitabı uzun zamandır rafimda, bu aralar okuyup ilginç&lt;br /&gt;birşeyler bulursam yazacağım.&lt;br /&gt;&lt;br /&gt;İnsanlarda ise bu probleme yaklaşmak ve düşünmek özellikle zor,&lt;br /&gt;çünkü "canım istedi" hissi öyle tatmin edici ki bunun arkasında&lt;br /&gt;birşeyler olduğunu düşünmeye karşı sanki beyinde bir oto-sansür&lt;br /&gt;mekanizması var.&lt;br /&gt;&lt;br /&gt;Not: Burada aradığım açıklamanın doğasıyla ilgili bir not düşeyim.&lt;br /&gt;Onur son mesajında bazı bilimsel kavramlarla dini kavramların&lt;br /&gt;temelde farkı olup olmadığına değinmiş. Çok basit bir fark&lt;br /&gt;var. Birini kullanarak gelecekte olacaklara dair (örneğin bir&lt;br /&gt;deneyde) tutarlı tahminlerde bulunabiliyoruz, diğeriyle&lt;br /&gt;bulunamıyoruz (bkz. bilim nedir tartışmamız). Benim&lt;br /&gt;"açıklamak"'tan kastettiğim bu. Örneğin benim motivasyonumu bir&lt;br /&gt;"ruh" modeliyle açıklamaya kalkarsak, sadece soruyu ertelemiş&lt;br /&gt;oluruz - "ruh"'un motivasyonunu nasıl açıklayacağız?  Üstelik&lt;br /&gt;fiziksel "ben"'in motivasyonunu birgün açıklayabilme ümidimiz var&lt;br /&gt;- atomlarımı açıklayan kuantumun kuralları kolay anlaşılmaz da&lt;br /&gt;olsa sonuçta bağlayıcı, üstelik virgülden sonra dokuz basamağa&lt;br /&gt;kadar.  "Ruh"'un davranışını belirleyen bağlayıcı kurallar&lt;br /&gt;bildiğim kadarıyla yok. Meteryalizm insanların körü körüne&lt;br /&gt;inandığı birşey değil, sadece bu tip sorulara, aynı soruyu bir&lt;br /&gt;adım öteye ertelemeden cevap vermeye çalışan tek alternatif. Neyin&lt;br /&gt;"açıklama" olup neyin olmadığı konusunda iyi anlaşalım.&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-7404189085901071097?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://tech.groups.yahoo.com/group/ariteknokent/message/965' title='İrade üzerine'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/7404189085901071097/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=7404189085901071097' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/7404189085901071097'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/7404189085901071097'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2007/06/irade-uzerine.html' title='İrade üzerine'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-1857861956381130294</id><published>2007-05-22T14:10:00.001+03:00</published><updated>2010-11-03T09:08:54.416+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Türkçe'/><title type='text'>A New Kind of Science - Stephen Wolfram</title><content type='html'>Geçenlerde Stephen Wolfram'ın (Mathematica'nın yaratıcısı) $25000&lt;br /&gt;ödüllü bir soru koyduğunu duydum web'e (http://www.wolframprize.org).&lt;br /&gt;Bu da beni bir süre önce aldığım ama 1000 sayfa olduğu için&lt;br /&gt;korkudan hiç açmadığım "A New Kind of Science" kitabına bakmaya&lt;br /&gt;ikna etti.  &lt;span class="fullpost"&gt;&lt;br /&gt;&lt;br /&gt;Ana konu her işi yapabilen (hesap anlamında), düşünebileceğiniz&lt;br /&gt;her makineyi simüle edebilen evrensel makinelerin var&lt;br /&gt;oluşu. Fiziksel cisimlerle uğraşmaya alışık dimağlarımız için bu&lt;br /&gt;inanması zor bir durum. Bir makine düşünün, istediğiniz zaman bir&lt;br /&gt;arabayı, istediğiniz zaman bir tornavidayı, istediğiniz zaman bir&lt;br /&gt;telefonu simüle edebiliyor! Bir kere A makinesinin B makinesini&lt;br /&gt;simüle edebilmesi için A'nın daha kompleks bir makine olması gerek&lt;br /&gt;gibi bir sezgimiz var. Eğer bu sezgi doğru olsaydı, evrensel bir&lt;br /&gt;makine mümkün olamazdı, her zaman ondan daha kompleks makineler&lt;br /&gt;tasarlayabilirdik.  (Asal sayıların sonsuz olmasının ispatı ile&lt;br /&gt;olan analojiye dikkatinizi çekerim).&lt;br /&gt;&lt;br /&gt;Ama Alan Turing 1936'da sanal alemde böyle bir makinenin mümkün&lt;br /&gt;olabileceğini gösterdi. İşin ilginç tarafı Turing'in evrensel&lt;br /&gt;makinesi oldukça basit. Sonsuz bir şerit üzerinde bir daktilonun&lt;br /&gt;kafası gibi ileri geri hareket edebilen yazıcı bir kafa düşünün&lt;br /&gt;(1936'da henüz PC'ler icat olunmamıştı, ama daktilolar vardı). Bu&lt;br /&gt;kafanın yapabildiği tek şey şeritte üzerinde bulunduğu sembolü&lt;br /&gt;okuyabilmek, ve sonlu sayıda kurala göre yeni bir sembol yazıp&lt;br /&gt;sağa ya da sola doğru bir pozisyon hareket edebilmek. Turing bu&lt;br /&gt;makinenin diğer tüm makineleri simüle edebileceğini ispatladı.&lt;br /&gt;&lt;br /&gt;Evrensel olan tek makine tabi bu daktilo bozması şey değil, Turing&lt;br /&gt;makineleri sadece evrenselliği ilk ispatlanan (gerçi Godel'in&lt;br /&gt;1931'deki meşhur ispatında da evrensel bir makine gizli olduğu&lt;br /&gt;söylenebilir). "A new kind of science" kitabının üçüncü chapterini&lt;br /&gt;okursanız iyi bir basit makineler listesi ve güzel tarifleri var&lt;br /&gt;(http://www.wolframscience.com/nksonline):&lt;br /&gt;&lt;br /&gt;- cellular automata&lt;br /&gt;- mobile automata&lt;br /&gt;- turing machines&lt;br /&gt;- substitution systems&lt;br /&gt;- tag systems&lt;br /&gt;- cyclic tag systems&lt;br /&gt;- register machines&lt;br /&gt;- symbolic systems&lt;br /&gt;&lt;br /&gt;Ve tabi bugün kullandığımız tüm bilgisayar dilleri ve CPU'larını&lt;br /&gt;da (sonsuz bir memory ekledikten sonra) bu evrensel makineler&lt;br /&gt;listesine koyabiliriz.&lt;br /&gt;&lt;br /&gt;Tüm bu makine çeşitleri içinde belli bir karmaşıklığa ulaştıktan&lt;br /&gt;sonra evrenselliği görmek mümkün - bu evrensel makinelerden&lt;br /&gt;herhangi birisi diğer hepsini simüle edebiliyor. Fakat her Turing&lt;br /&gt;makinesi evrensel değil. Örneğin sadece iki çeşit sembol okuyup&lt;br /&gt;yazabilen, ve sadece iki çeşit kurala göre hareket edebilen Turing&lt;br /&gt;makinelerinin evrensel olamayacağı gösterilmiş. Turing'in orijinal&lt;br /&gt;makinesinde tam kaç sembol ve kaç kural olduğunu bilmiyorum. Ama&lt;br /&gt;1960'larda Marvin Minsky 4 sembol ve 7 kural ile evrensel bir&lt;br /&gt;Turing makinesi yapılabileceğini göstermiş. Uzun zaman basit&lt;br /&gt;Turing makineleri konusu ile kimse uğraşmazken, 1990'larda Wolfram&lt;br /&gt;5 sembol ve 2 kural kullanarak evrensel bir makine&lt;br /&gt;yapılabileceğini göstermiş. Yani bu makinenin kendini içinde&lt;br /&gt;bulabileceği 10 farklı durum var. Bu 10 farklı durum için&lt;br /&gt;bulunduğu pozisyona ne yazacağına ve sağa mı sola mı hareket&lt;br /&gt;edeceğine karar veriyorsunuz, o kadar. Ve bu makine dünyada&lt;br /&gt;yazılmış tüm programları simüle etmek için yeterli! (Sadece&lt;br /&gt;programınızın uygun bir tercümesini makinenin şeridine başta&lt;br /&gt;yazmanız gerek, tabi simülasyonun biraz yavaş olduğunu da akıldan&lt;br /&gt;çıkarmamak lazım).&lt;br /&gt;&lt;br /&gt;Ödüllü soru ise Wolfram'ın seçtiği 2 kural ve 3 renkli bir Turing&lt;br /&gt;makinesinin evrenselliğini ispatlamak. Var mı gönüllü?&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-1857861956381130294?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://tech.groups.yahoo.com/group/ariteknokent/message/961' title='A New Kind of Science - Stephen Wolfram'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/1857861956381130294/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=1857861956381130294' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/1857861956381130294'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/1857861956381130294'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2007/05/new-kind-of-science-stephen-wolfram.html' title='A New Kind of Science - Stephen Wolfram'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-1201204850016820384</id><published>2007-05-22T13:53:00.001+03:00</published><updated>2010-11-03T09:08:54.417+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Türkçe'/><title type='text'>Bilinç üzerine bir okuma listesi</title><content type='html'>Uzun zaman sonra "bilinç" konusunu düşünmeye ve okumaya başladım.  Aşağıda küçük bir okuma listesi gönderiyorum ilgilenen için.  Ama öncelikle bu konuda "temel soru nedir" sorusuna kendi fikrimce bir cevap yazayım dedim. Belki tatmin edici olmayacak ama konuya bir yerinden girelim. &lt;span class="fullpost"&gt;&lt;br /&gt;&lt;br /&gt;Galileo ve Newton "evrensel kanunları" keşfetmeden ve Laplace&lt;br /&gt;cinini (&lt;a href="http://en.wikipedia.org/wiki/Laplace's_demon"&gt;Laplace's demon&lt;/a&gt;) icat etmeden önce "bilinç" bugün&lt;br /&gt;oluşturduğu kadar ciddi bir sorun değildi sanki.  Hayatı özgür&lt;br /&gt;irademiz, isteklerimiz ve inançlarımız cinsinden açıklayabiliyor,&lt;br /&gt;ruhun gerçekliğine maddeninkine olduğundan daha güçlü bir şekilde&lt;br /&gt;bağlanabiliyorduk. Neden olmasın ki, Descartes'in de dediği gibi&lt;br /&gt;en emin olduğumuz şeyler sonuçta kendi düşüncelerimiz,&lt;br /&gt;algılarımız, isteklerimiz - başka herşey yalan olabilir sonuçta...&lt;br /&gt;Bilim konusuna fazla ilgisi olmayan arkadaşlara bilinç konusunu&lt;br /&gt;açmaya çalıştığımda genelde benzer bir tepkiyle karşılaşıyorum:&lt;br /&gt;"problem nedir ki" diyorlar sanki, "ben neyin (irademin,&lt;br /&gt;algılarımın, inançlarımın) gerçek olduğunu biliyorum, senin&lt;br /&gt;bilimin bunları henüz açıklayamıyorsa bu bilimin sorunu..."&lt;br /&gt;&lt;br /&gt;Tabi bilimsel olarak kimse daha bilinç ve benzer psikolojik&lt;br /&gt;kavramları beynin fonksiyonları olarak açıklayabilmiş değil. Bazı&lt;br /&gt;filozoflar temelde böyle bir açıklamanın mümkün olabileceğine&lt;br /&gt;karşılar. Fakat yine de sonuçta herşeyin fiziğin temel kanunları&lt;br /&gt;cinsinden açıklanabileceğine dair yaygın bir inanç var. Fiziğin&lt;br /&gt;temel kanunları ise dışarıdan bir etki kabul etmiyor (enerjinin&lt;br /&gt;korunumu vb). Bu durumda bizim birşeyleri kontrol ediyor olmamız&lt;br /&gt;(ki en güçlü inançlarımızdan biri), imkansız gibi&lt;br /&gt;görünüyor. Sonuçta hepimiz atomların köleleri durumundayız&lt;br /&gt;gibi. Ama ben hiç de kendimi atomların oraya buraya ittirdiği bir&lt;br /&gt;madde bulutu gibi hissetmiyorum.&lt;br /&gt;&lt;br /&gt;Bu durumdan çıkmak için iki yol var. Birincisi irade, algı, istek&lt;br /&gt;ve inançların temel oluşundan taviz vermemek. Bir ihtimal fiziği&lt;br /&gt;modifiye etmemiz gerekebilir (Penrose metodu). Ya da fiziğin&lt;br /&gt;boşlukları içinde (kuantum belirsizlik, kaos, vs) iradeye yer&lt;br /&gt;aramak.  Ya da "bilinç" diye madde ve enerji dışında yeni bir&lt;br /&gt;temel olgu olduğunu varsayıp baştan başlamak.&lt;br /&gt;&lt;br /&gt;İkinci yol ise fizikten taviz vermemek. Bu pozisyondaki insanların&lt;br /&gt;yapması gereken ise "nasıl oluyor da bize böyle gözüküyor"&lt;br /&gt;sorusuna cevap vermek. Mesajın sonundaki listede verdiğim tüm&lt;br /&gt;kitaplar bu ikinci yolu takip etmeye çalışan insanların&lt;br /&gt;yazdıkları. Ama o zaman gerçekten atomların oraya buraya ittirdiği&lt;br /&gt;bir bulutsam (ya da Sagan'ın deyimiyle yıldız tozu isek) --&lt;br /&gt;&lt;br /&gt;1. Nasıl oluyor da ben istediğim zaman kolumu oynatabiliyorum?&lt;br /&gt;&lt;br /&gt;2. Kırmızı bana belli bir şekilde gözüküyor, beynimin her detayını&lt;br /&gt;bilseniz bile kırmızının bana nasıl gözüktüğünü anlayabilir&lt;br /&gt;misiniz?&lt;br /&gt;&lt;br /&gt;3. İnsanların (ve bazı diğer canlıların) davranışlarını "istek",&lt;br /&gt;"inanç", "irade" vb kavramları ile hayli iyi açıklayabilmek ve&lt;br /&gt;tahmin edebilmek nasıl mümkün. "istek" gibi kavramların fiziksel&lt;br /&gt;karşılığı nedir?&lt;br /&gt;&lt;br /&gt;4. Aynı benim gibi tepki veren bir robot yapsak onun da içinde&lt;br /&gt;dışarıya bakıp gerçekten birşeyler gören "birisi" olur mu? Yoksa&lt;br /&gt;içi boş olabilir mi? Bu robota sen bilinçli misin desek, o da&lt;br /&gt;benim gibi evet dese, yalan mı söylüyor olur?&lt;br /&gt;&lt;br /&gt;5. Eğer hepimiz yıldız tozu isek bir grup tozun diğer grup tozu&lt;br /&gt;yok etmesi, acı çektirmesi (zaten acı ne demek?) gibi konularda&lt;br /&gt;niye bu kadar hassasız? Ki böyle kötü şeyler olduğunda, oraya&lt;br /&gt;buraya ittiren atomlar sorumlu ise ceza sistemi saçma değil mi?&lt;br /&gt;&lt;br /&gt;6. Ben niye kendimi isteyen, yapan, belli şeylere inanan bir&lt;br /&gt;varlık gibi hissediyorum?&lt;br /&gt;&lt;br /&gt;Bu listeyi uzatmak mümkün. Ama dikkat ederseniz hemen her soru&lt;br /&gt;Laplace'in mekanik evreni ile psikolojik ve ahlaki kavramlarımızın&lt;br /&gt;çatışmasından türüyor. En son (ve bence en önemli) soru için&lt;br /&gt;Wittgenstein'ın bilinen bir anektodu gayet aydınlatıcı bence:&lt;br /&gt;&lt;br /&gt;Wittgenstein bir arkadaşına sormuş: "Hep merak etmişimdir,&lt;br /&gt;insanlar niye bu kadar uzun zaman güneşin dünyanın etrafında&lt;br /&gt;döndüğünü sanmışlar?" Arkadaşı cevap vermiş: "Neden olacak,&lt;br /&gt;buradan bakınca öyle gözüküyor da ondan..." Wittgenstein'ın&lt;br /&gt;cevabı: "Peki dünya kendi etrafında dönüyor olsaydı nasıl&lt;br /&gt;gözükecekti?"&lt;br /&gt;&lt;br /&gt;Peki hepimiz gerçekten trilyonlarca molekülün itişmesinden oluşan&lt;br /&gt;karmaşık yaratıklar olsaydık, kendimizi nasıl hissedecektik?&lt;br /&gt;&lt;br /&gt;======&lt;br /&gt;Okuma listesi:&lt;br /&gt;&lt;br /&gt;[1] Brainstorms, Daniel C. Dennett 1978&lt;br /&gt;- Dennett malesef Russell kadar temiz düşünüp ifade edebilen bir&lt;br /&gt;yazar değil, ama sanki fuzzy konularda doğru yönlere ışık tutuyor&lt;br /&gt;gibi.&lt;br /&gt;&lt;br /&gt;[2] Godel, Escher, Bach, Douglas Hofstadter 1979&lt;br /&gt;- AI'cıların pek övdüğü klasik eser. Okuduğum en güzel tasarlanmış&lt;br /&gt;(estetik olarak) kitaplardan biri (diyaloglar, anagramlar,&lt;br /&gt;resimler).  Tercümesi çok zor olmasına rağmen birileri uğraşıp&lt;br /&gt;Hofstadter'in yardımıyla Türkçe'ye çevirmiş, fakat hiç satmadığını&lt;br /&gt;duydum. Her fırsatta bir iki tane alıp öğrencilerime hediye&lt;br /&gt;ediyorum :)&lt;br /&gt;&lt;br /&gt;[3] Mind's I, Hofstadter and Dennett 1981&lt;br /&gt;- Okuması kolay güzel hikayelerden (philosophical fiction?)&lt;br /&gt;oluşuyor - bir bilinç mesnevi'si denebilir - bence grubun mecburi&lt;br /&gt;okuma listesinde olmalı ki kompleks fikirleri anlatmak için bi&lt;br /&gt;sürü uğraşacağımıza bu hikayelere gönderme yapabilelim.&lt;br /&gt;&lt;br /&gt;[4] Social Brain, Gazzaniga 1987&lt;br /&gt;- Bilincin aslında göründüğü gibi olmadığını (?) en güzel gösteren&lt;br /&gt;neuroscience öykülerinden biri. En büyük problem bilinci herkes&lt;br /&gt;anladığını hissediyor, gerçekten anlama yolunda ilk yapılması&lt;br /&gt;gereken bu "sanılan" anlayışın böyle güzel deneylerle yavaş yavaş&lt;br /&gt;yıkılması.  Bu arada Aslı'nın gönderdiği görsel illüzyon linki de&lt;br /&gt;iyi bir başlangıç.&lt;br /&gt;&lt;br /&gt;[5] Consciousness Explained, Daniel C. Dennett 1991&lt;br /&gt;- Dennett sonunda orada burada ağzında gevelediği tüm düşünceleri&lt;br /&gt;bir araya koymuş. Ama tabi "bilinç nedir?"-&gt;"bilinç ilüzyondur",&lt;br /&gt;"qualia nedir?"-&gt;"qualia yoktur" gibi cevaplardan oluştuğu için&lt;br /&gt;rakipleri kitaba "Consciousness explained away" ismi vermişler.&lt;br /&gt;&lt;br /&gt;[6] Consciousness: A Very Short Introduction, Susan Blackmore 2005&lt;br /&gt;- Diğerlerini çok güzel özetleyen kısa ve güzel bir&lt;br /&gt;başlangıç. Sadece bir kitaba bakabilecekseniz bunu okuyun derim.&lt;br /&gt;&lt;br /&gt;[7] Good and Real, Gary L. Drescher 2006&lt;br /&gt;- Mekanik evren modeli içerisinde sadece bilinci, zamanı ve kaderi&lt;br /&gt;değil, aynı zamanda ahlaki da açıklayabileceğimizi savunan iddialı&lt;br /&gt;bir eser.&lt;br /&gt;&lt;br /&gt;[8] I am a Strange Loop, Douglas Hofstadter 2007&lt;br /&gt;- Bunu yeni okuyorum daha, ama Hofstadter'in kendi deyimiyle [2]&lt;br /&gt;GEB'de anlatmak istediklerini tam anlamayanlar için yazılmış bir&lt;br /&gt;kitap.&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-1201204850016820384?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://tech.groups.yahoo.com/group/ariteknokent/message/959' title='Bilinç üzerine bir okuma listesi'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/1201204850016820384/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=1201204850016820384' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/1201204850016820384'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/1201204850016820384'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2007/05/bilinc-uzerine-bir-okuma-listesi.html' title='Bilinç üzerine bir okuma listesi'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-4485177508409412485</id><published>2007-04-11T12:01:00.004+03:00</published><updated>2009-11-17T16:31:25.869+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Publications'/><title type='text'>Locally Scaled Density Based Clustering</title><content type='html'>Ergun Biçici and Deniz Yuret. In &lt;i&gt;International Conference on Adaptive and Natural Computing Algorithms (ICANNGA 2007), LNCS 4431, Part I, Springer-Verlag. &lt;/i&gt; (&lt;a href="http://deniz.yuret.com/2007/04/locally-scaled-density-based-clustering/LSDBC-icannga07.pdf"&gt;PDF&lt;/a&gt;, &lt;a href="http://deniz.yuret.com/2007/04/locally-scaled-density-based-clustering/LSDBC-icannga07.ps"&gt;PS&lt;/a&gt;, &lt;a href="http://deniz.yuret.com/2007/04/locally-scaled-density-based-clustering/ICANNGATalk.pdf"&gt;Presentation&lt;/a&gt;, &lt;a href="http://deniz.yuret.com/2007/04/locally-scaled-density-based-clustering/lsdbccode.zip"&gt;Code&lt;/a&gt;, &lt;a href="http://deniz.yuret.com/2007/04/locally-scaled-density-based-clustering/CLSDBC_README.txt"&gt;Readme&lt;/a&gt;)&lt;span class="fullpost"&gt;&lt;br /&gt;&lt;br /&gt;Abstract: Density based clustering methods allow the identification of arbitrary, not necessarily convex regions of data points that are densely populated. The number of clusters does not need to be specified beforehand; a cluster is defined to be a connected region that exceeds a given density threshold. This paper introduces the notion of local scaling in density based clustering, which determines the density threshold based on the local statistics of the data. The local maxima of density are discovered using a k-nearest-neighbor density estimation and used as centers of potential clusters. Each cluster is grown until the density falls below a pre-specified ratio of the center point’s density. The resulting clustering technique is able to identify clusters of arbitrary shape on noisy backgrounds that contain significant density gradients. The focus of this paper is to automate the process of clustering by making use of the local density information for arbitrarily sized, shaped, located, and numbered clusters. The performance of the new algorithm is promising as it is demonstrated on a number of synthetic datasets and images for a wide range of its parameters.&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-4485177508409412485?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://deniz.yuret.com/2007/04/locally-scaled-density-based-clustering/LSDBC-icannga07.pdf' title='Locally Scaled Density Based Clustering'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/4485177508409412485/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=4485177508409412485' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/4485177508409412485'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/4485177508409412485'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2007/04/locally-scaled-density-based-clustering.html' title='Locally Scaled Density Based Clustering'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-265376350351978391</id><published>2007-03-19T22:31:00.000+02:00</published><updated>2010-11-03T09:08:54.418+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Notes'/><title type='text'>Drexler quote</title><content type='html'>&lt;p&gt;It makes sense to think in terms of three levels of knowledge about a field:&lt;/p&gt;             &lt;ol&gt;&lt;li&gt; Knowing what a field is about—knowing what sorts of physical systems and phenomena it deals with, and what sorts of questions it asks and answers.&lt;/li&gt;&lt;li&gt;                 Knowing the &lt;a href="javascript:loadBrain('Content')" onmouseover="playBrain('Content')" onmouseout="stopBrain()" class="thought"&gt;content&lt;/a&gt; of a field in a qualitative sense—having a good feel for what sorts of phenomena can be important in what circumstances, and knowing when you need answers from work in that field.&lt;/li&gt;&lt;li&gt;Knowing how to get those answers yourself, based on personal mastery of enough of the field's subject matter.&lt;/li&gt;&lt;/ol&gt;             &lt;p&gt;If one has enough knowledge at levels (1) and (2) in enough fields, then one can steer clear of problems in those fields while doing work in a related field where you have knowledge at level (3). And this is a good thing, because knowledge at levels (1) and (2) takes far less time to acquire. But to make proper use of knowledge at levels (1) and (2) requires a harsh discipline: attempt to assume the worst about what you don't know. Don't assume that a poorly-understood physical effect will somehow save your design; do assume (until finding otherwise) that it may utterly ruin it. Without this discipline, you'll become an intellectual hazard. With it, you'll be able to make a real contribution.&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-265376350351978391?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://www.kurzweilai.net/meme/frame.html?main=/articles/art0699.html' title='Drexler quote'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/265376350351978391/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=265376350351978391' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/265376350351978391'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/265376350351978391'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2007/03/drexler-quote.html' title='Drexler quote'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-4271537860009173659</id><published>2007-03-11T13:42:00.001+02:00</published><updated>2010-11-03T09:08:54.419+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Notes'/><title type='text'>Technology and society</title><content type='html'>Two interesting articles on technology and its impact on society.  I received the first one from Ergun Bicici.  The second one came from Emrah Çevik.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://www.norvig.com/speech.html"&gt;U.C. Berkeley Graduation Speech&lt;/a&gt; by Peter Norvig&lt;br /&gt;&lt;a href="http://www.econ.yale.edu/smith/econ116a/keynes1.pdf"&gt;Economic Possibilities for our Grandchildren&lt;/a&gt; by JM Keynes (1930)&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-4271537860009173659?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://tech.groups.yahoo.com/group/ariteknokent/message/874' title='Technology and society'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/4271537860009173659/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=4271537860009173659' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/4271537860009173659'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/4271537860009173659'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2007/03/technology-and-society.html' title='Technology and society'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-2186220864679498595</id><published>2007-02-22T13:26:00.001+02:00</published><updated>2010-11-03T09:08:54.419+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Türkçe'/><title type='text'>Bilinç üzerine</title><content type='html'>Kavram uzayımızda "göründüğü gibi olmama" ödülleri dağıtılsa&lt;br /&gt;herhalde "bilinç" birincilik ödülüne layık görülürdü.&lt;br /&gt;&lt;br /&gt;İnsanın aklına gelen "peki bilgileri algılayan kim" sorusu belki&lt;br /&gt;bilinci anlama konusunda bizi en çok uğraştıran soru.  İçeride&lt;br /&gt;algıları izleyen bir küçük bilinç (homunculus) varsaymak yetmiyor&lt;br /&gt;çünkü o zaman aynı soruyu küçük bilinç için sormak gerek.  Kaldı&lt;br /&gt;ki beyin hakkında bildiklerimiz algıların yakınsandığı bir merkez&lt;br /&gt;fikrini pek desteklemiyor. Desteklese bile bu tatmin edici&lt;br /&gt;olmazdı: beyninizi açıp, işte şuradaki nöron kümesi senin bilincin&lt;br /&gt;deseler bu size çok şey ifade etmeyecek. Örneğin şu an dinlediğim&lt;br /&gt;müziğin işitme merkezimde bir seri nöron tıklaması olarak kod&lt;br /&gt;edildiğini biliyorum ama bu duyduğum müziğin öznel kalitesini&lt;br /&gt;açıklamaya yetmiyor. &lt;span class="fullpost"&gt;&lt;br /&gt;&lt;br /&gt;Bir alternatif bilinci beynin dışında aramak. Tüm büyük dinler&lt;br /&gt;(Budizm dışında sanırım) manevi bir dünyanın gerçekliğini kabul&lt;br /&gt;ediyor. Bu dualist pozisyon Descartes'tan beri pek çok ciddi&lt;br /&gt;filozof tarafından da savunuldu. Başka konularda düşüncelerine çok&lt;br /&gt;saygı duyduğum Popper bu akımın en son örneklerinden biri&lt;br /&gt;[3]. Buradaki en büyük problem artık her ciddi filozofun enerjinin&lt;br /&gt;korunumu gibi fiziksel kanunlara inanması ve bu da fizik dışında&lt;br /&gt;bir olgunun fiziksel cisimleri (elimiz kolumuz gibi) etkilemesini&lt;br /&gt;imkansız hale getiriyor.&lt;br /&gt;&lt;br /&gt;Tabi bu "bilinç" meteryali her ne ise illa fizik dışında olmak&lt;br /&gt;zorunda değil. Dualist pozisyonu korumak için fiziği genişletmeye&lt;br /&gt;ve fizik içerisinde bilinç için yer açmaya çalışanlar da var. Bu&lt;br /&gt;forumda da bahsi geçti daha evvel, örneğin Roger Penrose [4].&lt;br /&gt;&lt;br /&gt;Bugünkü bilgimiz ile en tutarlı pozisyon ise bilinç için yeni bir&lt;br /&gt;meteryal aramak yerine bilincin beyindeki işlemlerden ibaret&lt;br /&gt;olduğunu kabul etmek olurdu. Malesef bu meteryalist yaklaşımın&lt;br /&gt;önündeki en büyük psikolojik engel başta sorduğumuz soru: "peki&lt;br /&gt;bilgileri algılayan kim"...&lt;br /&gt;&lt;br /&gt;Bu konuda yazılmış çizilmiş çok şey var. Özellikle [1], [2] ve&lt;br /&gt;[5]'i tavsiye ederim. Bilincin bir ilüzyon olduğuna insanın&lt;br /&gt;kendini dürüstçe inandırabilmesi kolay değil (bu arada ilüzyon&lt;br /&gt;"yok", "yalan" anlamında değil, sadece "aslı göründüğü gibi değil"&lt;br /&gt;anlamında).&lt;br /&gt;&lt;br /&gt;Başka konularda ilüzyonları yıkmak bir nebze daha kolay. Örneğin&lt;br /&gt;şu an görme duyum bana masami, bilgisayarımı, telefonumu, rafları,&lt;br /&gt;kısaca odamın bu köşesindeki hemen her şeyi rahatça algıladığımı&lt;br /&gt;söylüyor.  Fakat basit bir deneyle aslında başparmağımın tırnağı&lt;br /&gt;kadar bir alan dışında hemen hemen kör olduğumu görmem&lt;br /&gt;kolay. Yemekte pilav mercimek vs yediğiniz bir gün, tabağınızda&lt;br /&gt;son kalan tanecikleri tek gözünüzü kapatıp diğerini belli bir&lt;br /&gt;noktaya sabitleyerek saymaya çalışın - ne kadar imkansız olduğunu&lt;br /&gt;göreceksiniz. Ya da eğer partneriniz razı olursa deneyin:&lt;br /&gt;bacağınızın arkasına tek bir sivri cisimle mi dokunulduğunu yoksa&lt;br /&gt;aynı anda aralarında 2-3 cm olan iki farklı cisimle mi&lt;br /&gt;dokunulduğunu ayırdedemeyeceksiniz.&lt;br /&gt;&lt;br /&gt;Eğer Memduh haklı ise ve bilinç de zamanı algılamamızı sağlayan&lt;br /&gt;bir duyu oranıysa belki yukarıdakilere benzer ilüzyon yıkıcı&lt;br /&gt;testler tasarlayabiliriz, ne dersiniz?&lt;br /&gt;&lt;br /&gt;Susan Blackmore meditasyon tavsiye ediyor. Benim için dönüm&lt;br /&gt;noktası bir gün bana tüm dürüstlüğü ile "evet ben bilinçliyim"&lt;br /&gt;diyecek bir robot yapabileceğime inanmam oldu. Etrafında olup&lt;br /&gt;bitenleri algılayıp hatırlayabilen; başka canlıları, onların niyet&lt;br /&gt;ve isteklerini modelleyebilen; neler yaptıklarını hatırlayıp neler&lt;br /&gt;yapacaklarını tahmin edebilen bir robot, bu analiz gücünü kendi&lt;br /&gt;üzerine çevirdiğinde ne görür tahmin edersiniz? Acaba onu mekanik&lt;br /&gt;bir robot olduğuna inandırabilir miyiz kolay kolay?&lt;br /&gt;&lt;br /&gt;[1] Consciousness, A Very Short Introduction -- Susan Blackmore&lt;br /&gt;[2] Consciousness Explained -- Daniel C. Dennett&lt;br /&gt;[3] The Self and Its Brain -- Karl Popper and John Eccles&lt;br /&gt;[4] The Emperor's New Mind -- Roger Penrose&lt;br /&gt;[5] Mind's I -- Douglas Hofstadter and Daniel Dennett&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-2186220864679498595?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://tech.groups.yahoo.com/group/ariteknokent/message/857' title='Bilinç üzerine'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/2186220864679498595/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=2186220864679498595' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/2186220864679498595'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/2186220864679498595'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2007/02/bilinc-uzerine.html' title='Bilinç üzerine'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-4464373199364991321</id><published>2007-02-21T13:18:00.001+02:00</published><updated>2010-11-03T09:08:54.420+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Türkçe'/><title type='text'>Akıllı Tasarım</title><content type='html'>Bugüne Berkin'in gönderdiği Türkiye'nin ilk "Akıllı Tasarım"&lt;br /&gt;(intelligent design) konferansı ile ilgili mesajı okuyarak&lt;br /&gt;başladım.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://www.mustafaakyol.org/archives/2007/02/post_1.php"&gt;http://www.mustafaakyol.org/archives/2007/02/post_1.php&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;İstanbul Büyük Şehir Belediyesini tebrik ediyorum. Duyduğum&lt;br /&gt;tepkinin kaynağı akıllı tasarım fikrinin yanlış olduğunu&lt;br /&gt;düşünmemden ziyade, bu tip anti-bilim propogandayı bile&lt;br /&gt;Amerika'nın "bible belt"'inden ithal ediyor olmamız. Bir otomobil&lt;br /&gt;tasarımı yaparken orijinal olamamamızı anlıyorum, McDonalds ve El&lt;br /&gt;Torito'yu İstanbul'a getirmemizi anlıyorum, ama sofu fikirler&lt;br /&gt;üretirken lütfen Amerika'nın yardımına ihtiyacımız olduğunu&lt;br /&gt;söylemesin kimse. Bu işte onlardan yüzlerce yıl daha tecrübeliyiz,&lt;br /&gt;daha yaratıcı olabiliriz... &lt;span class="fullpost"&gt;&lt;br /&gt;&lt;br /&gt;Son fiziksel toplantımızda Memduh tüm bulguları kabul edip açık&lt;br /&gt;fikirli olmasına rağmen evrime inanmakta güçlük çeken birine nasıl&lt;br /&gt;bir argüman sunulabileceğini sorgulamıştı. Bu konuda okuduğum&lt;br /&gt;kitaplar malesef çoğunlukla karşı tarafın straw-man argümanlarıyla&lt;br /&gt;uğraşıyor.  Bir yandan aklımı bu kurcalamakta.&lt;br /&gt;&lt;br /&gt;Tabi bu arada ders saatim geldi 60 mühendislik öğrencisine&lt;br /&gt;olasılık öğretmek için (günün konusu permutasyon, kombinasyon)&lt;br /&gt;girdim sınıfa.  Tahmin edebileceğiniz gibi dersin yarısından&lt;br /&gt;fazlası evrim konusundaki örneklere gitti :)&lt;br /&gt;&lt;br /&gt;Şu maymunların rastgele daktilo tuşlarına basarak Shakespeare&lt;br /&gt;yazma örneğini ele alalım. Genelde canlılarda görülen karmaşık&lt;br /&gt;yapıların (göz, beyin, karaciğer) "rastgele" ortaya çıkması bizim&lt;br /&gt;maymunların edebi yetenekleriyle karşılaştırılır. Bu argümandaki&lt;br /&gt;ilk zayıf noktayı öğrenciler hemen buldu: dünyada çok sayıda canlı&lt;br /&gt;çok uzun zamandır evrim oyununu oynamaktaydı bu iş bir iki&lt;br /&gt;maymunun işi gibi değildi. Fakat bu zayıf nokta çökertici&lt;br /&gt;değil. 100,000 harflik tipik bir Shakespeare oyununu rastgele&lt;br /&gt;yazma ihtimali 30^-100,000 gibi bir sayı ki evrenin tüm&lt;br /&gt;parçacıkları big-bang'den beri bu işle uğraşıyor da olsa&lt;br /&gt;yetmiyor. Zayıf nokta maymunların rastgele tuşlara bastığı&lt;br /&gt;varsayımı. Evrimde yanlış tuşa basan maymun vefat ediyor, yazdığı&lt;br /&gt;harf siliniyor, yerine başka maymun geçiyor. Ta ki doğru tuşa&lt;br /&gt;basana kadar. Bu şekilde her harf için ortalama 30 maymun telef&lt;br /&gt;etsek, 3 milyon maymun bir Shakespeare oyununu rahat yazar. Bu&lt;br /&gt;kadar maymun İstanbul'da bile rahatça bulunur.&lt;br /&gt;&lt;br /&gt;Fakat tüm bunlar boş. Esas nokta bu değil. Evrime inanmak&lt;br /&gt;istemeyen bir insan bence ikna edilemez. Daha basit bir örnek&lt;br /&gt;vereyim. Eğer ben evrenin bir gün önce başladığına inanmak&lt;br /&gt;istiyorsam kimse beni bunun aksine ikna edemez. Hatıralarım mı?&lt;br /&gt;Ben beynimde o hafızalarla üretildim. Dünyadaki fosiller mi? O&lt;br /&gt;kayalar da o şekilde üretildi.  Karbon tarihleme mi? Dün bu işleri&lt;br /&gt;yapan zat o cisimlerdeki C-12 ve C-14 oranını o şekilde&lt;br /&gt;ayarladı. Söyleyebileceğiniz hiçbir şey evrenin dünden önce var&lt;br /&gt;olmadığını ya da yarından sonra var olmayacağını ispatlayamaz bana&lt;br /&gt;- çünkü mantıksal bir delik yok varsayımımda.&lt;br /&gt;&lt;br /&gt;Peki o zaman neden olası varsayımlardan biri (evren 14 milyar yıl&lt;br /&gt;önce big-bang ile başladı vs vs) bana diğer varsayımlardan daha&lt;br /&gt;çekici geliyor? Geçenlerde bu forumda bilim nedir ne değildir&lt;br /&gt;(yanlışlanabilirlik vs) konusunu tartıştık. Fakat&lt;br /&gt;yanlışlanabilirlikten kime ne? Benim ne zorum var her söylediğim&lt;br /&gt;yanlışlanabilir olsun diye yırtınayım? Niye bu düşünce sistemini&lt;br /&gt;diğer düşünce sistemlerine tercih ediyorum?&lt;br /&gt;&lt;br /&gt;Sonunda iki sebebi olduğuna karar verdim.&lt;br /&gt;&lt;br /&gt;Birincisi merak ve estetik. Küçüklükten beri ne nasıl çalışır&lt;br /&gt;merak ediyorum. Canlıları cansızlardan ayıran nedir, dinazorlar&lt;br /&gt;nereden geldi, nereye gittiler, genler nasıl çalışır, hücrelerin&lt;br /&gt;içinde ne olup bitiyor, tüm bileşenlerini bir araya getirsek canlı&lt;br /&gt;hücre yapabilir miyiz - bunların hepsi birbirine bağlı kafamda&lt;br /&gt;dönen sorular. Birilerinin gelip bana böyle oldu çünkü akıllı biri&lt;br /&gt;böyle tasarladı demesi, "bu soruyu sorma" demesinden farklı değil&lt;br /&gt;- dolayısıyla merakımı tatmin etmiyor - ayrıca fikir olarak bir&lt;br /&gt;gecekondu mahallesi kadar estetiği yok.&lt;br /&gt;&lt;br /&gt;İkinci sebep fayda. Bilimsel yöntemi kullanarak problemleri&lt;br /&gt;çözebileceğimize, hastalıkları yenebileceğimize, doğayı daha iyi&lt;br /&gt;kontrol edebileceğimize vs. inanıyorum. Hatta bu fikir bana evren&lt;br /&gt;dünden önce var olmasaydı bile cazip geliyor. Sadece evrenin&lt;br /&gt;yarından sonra da var olacağına inanmam merak etmem ve&lt;br /&gt;problemlerle uğraşmam için yeterli. Gerçi yarın evren sona erecek&lt;br /&gt;olsa da matematikle uğraşırdım herhalde.&lt;br /&gt;&lt;br /&gt;İşin ilginç tarafı eğer bu son söylediğim doğru ise bilime ve&lt;br /&gt;dolayısıyla evrime inanmayanların teknolojide, tıpta geri kalması&lt;br /&gt;ve evrim kanunlarına göre elimine olmaları beklenebilirdi. Tabi&lt;br /&gt;kurduğumuz sosyal düzen insanların bilimsel inançlarıyla çocuk&lt;br /&gt;yapma sayılarını pek correlated tutmuyor. İronik olurdu öyle&lt;br /&gt;olsa. Sanki evrime inanmayanları evrim cezalandırıyormuş gibi...&lt;br /&gt;&lt;br /&gt;Tabi bu da akıllı tasarımcılara karşı ikna edici bir argüman&lt;br /&gt;değil.  Çünkü onlar da benim gibilerin pek yakında kutsal bir&lt;br /&gt;felaket sonucu ortadan kaldırılacağına inanıyorlar :)&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-4464373199364991321?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://tech.groups.yahoo.com/group/ariteknokent/message/853' title='Akıllı Tasarım'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/4464373199364991321/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=4464373199364991321' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/4464373199364991321'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/4464373199364991321'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2007/02/akll-tasarm.html' title='Akıllı Tasarım'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-8133897832031267484</id><published>2007-02-21T10:44:00.000+02:00</published><updated>2010-11-03T09:08:54.421+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Books'/><title type='text'>Reckoning with Risk by Gerd Gigerenzer</title><content type='html'>This is the last one of the new breed of "innumeracy" books I have read.  Not one of the better ones.  Still ok for generating examples for my probability course.  Other examples of the genre include:&lt;span class="fullpost"&gt;&lt;br /&gt;&lt;br /&gt;- Innumeracy by Paulos&lt;br /&gt;- A mathematician reads the newspaper by Paulos&lt;br /&gt;- How to lie with statistics by Huff&lt;br /&gt;&lt;br /&gt;Not in the same genre but the following books on the history of probability and statistics may also be useful for stories and examples:&lt;br /&gt;&lt;br /&gt;- Games Gods and Gambling by David&lt;br /&gt;- The Probabilistic Revolution by Kruger et.al.&lt;br /&gt;- The History of Statistics by Stigler.&lt;br /&gt;- The Lady Tasting Tea by Salsburg.&lt;br /&gt;&lt;br /&gt;The website&lt;br /&gt;&lt;a href="http://www.planetqhe.com/beta/information/home.htm"&gt;http://www.planetqhe.com/beta/information/home.htm&lt;/a&gt; contains nice problems and animations as well as some book recommendations and links.&lt;br /&gt;&lt;br /&gt;- The Magical Maze by Ian Stewart&lt;br /&gt;- Chance Rules by Brian Everitt&lt;br /&gt;- Inevitable Illusions by Massimo Piattelli-Palmarini&lt;br /&gt;- Taking Chances by John Haigh&lt;br /&gt;- Fifty Challenging Problems in Probability by Frederick Mosteller&lt;br /&gt;- Cartoon Guide to Statistics by Gonick and Smith&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;a style="font-family: arial;" href="http://www.amazon.com/exec/obidos/search-handle-url/002-3023496-0112000?%5Fencoding=UTF8&amp;search-type=ss&amp;amp;index=books&amp;amp;field-author=Massimo%20Piattelli-Palmarini"&gt;&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-8133897832031267484?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/8133897832031267484/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=8133897832031267484' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/8133897832031267484'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/8133897832031267484'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2007/02/reckoning-with-risk-by-gerd-gigerenzer.html' title='Reckoning with Risk by Gerd Gigerenzer'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-2788873710822650264</id><published>2007-02-14T13:08:00.001+02:00</published><updated>2010-11-03T09:08:54.422+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Türkçe'/><title type='text'>Sigara ve özgürlük</title><content type='html'>Amartya Sen'in &lt;a href="http://tech.groups.yahoo.com/group/ariteknokent/message/823"&gt;Unrestrained smoking is a libertarian half-way house&lt;/a&gt; makalesi üzerine düşünceler... &lt;span class="fullpost"&gt;&lt;br /&gt;&lt;br /&gt;Libertarian söylem içinde delik bulacağım derken, Amartya Sen'in&lt;br /&gt;argümanlarında içinden tren geçirilecek boşluklar bırakması ilginç&lt;br /&gt;tabi. Bu boşluklardan biri Emre'nin bahsettiği Platon limitinden&lt;br /&gt;bahsedilmeden geçmiş gelecek ben çatışmasına devletin burnunun&lt;br /&gt;sokulması, bir diğeri de herkesin sosyal yardımı bir takım&lt;br /&gt;özgürlüklerin kısıtlanmasına tercih edeceğinin tartışmasız&lt;br /&gt;varsayılması (hatta alternatif "monstrously unforgiving society"&lt;br /&gt;olarak betimlenip bu konuda çözüm düşünmek isteyebilecekler baştan&lt;br /&gt;damgalanmış).&lt;br /&gt;&lt;br /&gt;Devlet eğer sosyal bir anlaşmanın ürünü olacaksa beni neden&lt;br /&gt;koruyup neden kormayacağına da ben karar verebilmeliyim. Korunmak&lt;br /&gt;istediğim ölçüde özgürlüklerimin kısıtlanmasına da&lt;br /&gt;katlanmalıyım. Örneğin beni kimsenin öldürmesini istemiyorsam&lt;br /&gt;devleti de beni koruması için görevlendirdiysem, benim de&lt;br /&gt;başkalarını öldürme özgürlüğümün elimden alınmasına ses çıkarmamam&lt;br /&gt;gerek. (Tabi farklı insanlar farklı şeylerden korunmak&lt;br /&gt;isteyecekler, durum karışacak vs.)&lt;br /&gt;&lt;br /&gt;Peki gerçekten gelecek benleri bugünkü benden korumasını istiyor&lt;br /&gt;muyum ben üçüncü bir kişinin? Bu bana bir kabus gibi&lt;br /&gt;geliyor. Bugünkü ben geçmiş beni yaptığı bütün hatalarla (belki&lt;br /&gt;özellikle hatalarla) kabul edip seviyor. Birileri bana onu yapma&lt;br /&gt;bunu yapma deyip beni "korusaydı" karşınızda bu Deniz olmayacaktı&lt;br /&gt;(ana babama sorun, çok denediler, dinletemediler). Farklı&lt;br /&gt;ben'lerin farklı bireyler olarak kabul edilmesine bir itirazım&lt;br /&gt;yok, ama lütfen aramızdaki anlaşmazlıklara başkası&lt;br /&gt;karışmasın. Hatta biraz daha ileri gidersek, ben yakın ailem ve&lt;br /&gt;yakın arkadaşlarım arasındaki anlaşmazlıklara da devletin&lt;br /&gt;karışmasını tercih etmem, en azından o durumlarda uygulanan&lt;br /&gt;kurallarla hiç tanımadığım insanlarla aramdaki ilişkilere&lt;br /&gt;uygulanan kuralların aynı olması bana saçma geliyor.&lt;br /&gt;&lt;br /&gt;Sosyal konularda ne zaman basit prensiplere (aksiyomlara) dayalı&lt;br /&gt;çözümler üzerinde düşünmek istesem, bu konulara benden çok kafa&lt;br /&gt;yoran bazı tanıdıklarım konunun böyle basit olmadığını, insanların&lt;br /&gt;karmaşık olduğunu, benim basit teknik kafamla düşünüp önerdiğim&lt;br /&gt;çözümlerdeki delikleri işaret ederek gösterir, naivete'me&lt;br /&gt;gülerler. Ben yine de sosyal konularda basit prensiplerden&lt;br /&gt;çıkarılan ideallere aşımtotik olarak yaklaşmaya çalışmanın, fuzzy&lt;br /&gt;güt feeling'lerle prensipsiz bir çözüm karmaşası üretmekten iyi&lt;br /&gt;olduğu düşüncesindeyim. Doğru çözümlere bu şekilde daha kolay&lt;br /&gt;ulaşabileceğimizi ispatlayamasam bile en azından neyin doğru neyin&lt;br /&gt;yanlış olduğunu tartışabileceğimiz bir asgari müştereğimiz olur.&lt;br /&gt;&lt;br /&gt;John Stuart Mill'in düşünceleri, ve Libertarianism benim bu&lt;br /&gt;anlamda anlayabildiğim idealler. (Libertarian'lar sosyal yardım&lt;br /&gt;olmasın demiyor bu arada, zorla olmasın diyorlar sadece) Pratikte&lt;br /&gt;sorunları çıksa da oturup düşünebiliyorum nasıl bu ideallerle&lt;br /&gt;consistent başka bir çözüm olabilir diye. Birisi Amartya Sen'in&lt;br /&gt;aksiyomlarını bana anlatabilir mi?&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-2788873710822650264?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://tech.groups.yahoo.com/group/ariteknokent/message/832' title='Sigara ve özgürlük'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/2788873710822650264/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=2788873710822650264' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/2788873710822650264'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/2788873710822650264'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2007/02/sigara-ve-ozgurluk.html' title='Sigara ve özgürlük'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-3614351480156861993</id><published>2007-02-12T12:53:00.001+02:00</published><updated>2010-11-03T09:08:54.423+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Türkçe'/><title type='text'>Permutation City - Greg Egan</title><content type='html'>Değişik "varoluş" çeşitlerinin virtiyozu Greg Egan'dan Permutation&lt;br /&gt;City adında bir sci-fi hikaye okudum. Aslında Egan'a science&lt;br /&gt;fiction demek haksızlık, philosophical fiction demek&lt;br /&gt;lazım. Varoluş çeşitleri derken aklınıza gelebilecek biyolojik,&lt;br /&gt;sentetik, yazılımla simüle edilmiş, ya da simüle edilen bir&lt;br /&gt;evrenin içinde evrimleşmiş her türlü bilinç ve bunların&lt;br /&gt;aralarındaki etkileşimler, kopyalamalar, zamanı algılamalarındaki&lt;br /&gt;görece farklar doyasıya işlenmiş. İlgi çekebilecek birkaç konu:&lt;br /&gt;1000 yaşına gelen insanlar long-term hafızalarındaki yavaşlamayla&lt;br /&gt;nasıl başa çıkarlar, sizden 17 kat yavaş run edebilen&lt;br /&gt;simülasyonunuzla nasıl iletişim kurarsınız, siz simülasyonu geriye&lt;br /&gt;doğru çalıştırdığınızda simülasyondaki bilinçlerin zaman kavramına&lt;br /&gt;ne olur, peki simülasyonu durdurduğunuzda o bilinçler için zaman&lt;br /&gt;durur mu?&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-3614351480156861993?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://tech.groups.yahoo.com/group/ariteknokent/message/820' title='Permutation City - Greg Egan'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/3614351480156861993/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=3614351480156861993' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/3614351480156861993'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/3614351480156861993'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2007/02/permutation-city-greg-egan.html' title='Permutation City - Greg Egan'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-516881834920692914</id><published>2007-02-12T02:07:00.000+02:00</published><updated>2010-11-03T09:08:54.424+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Türkçe'/><title type='text'>Fikir incileri</title><content type='html'>Her konunun (okulda nefret ettiğim konuların bile) sahip olduğu&lt;br /&gt;fikir incileri olduğuna inanıyorum.  Okul hayatımızda iç bayıcı&lt;br /&gt;mantıksal sıra yerine bu incilerle beslesek çocukları eminim pek&lt;br /&gt;çok konuda daha meraklı olacaklar.  Tabi hangi konunun derinine&lt;br /&gt;inmek istese incilerle dolu kıyıyı aşıp güç bela yürünen&lt;br /&gt;bataklıklarla uğraşması gerekiyor insanın. Ama bir kere incilerin&lt;br /&gt;tadını almış olan insanlar için bu daha dayanılır bir işkence. &lt;span class="fullpost"&gt;&lt;br /&gt;&lt;br /&gt;Ortaokulda arkadaşım Ünal, abisinden öğrendiği Pisagor teoreminden&lt;br /&gt;bahsetmişti bigün otobüste giderken. İlk tepkim "hadi canım" oldu.&lt;br /&gt;Matematikçiler her dik üçgen için a^2 + b^2 = c^2 olduğunu nasıl&lt;br /&gt;bilebilirlerdi? Bir kere her üçgeni çizip ölçmeleri gerekirdi ki&lt;br /&gt;bu da sonsuza kadar sürerdi. Üstelik bir sonraki üçgenin kurala&lt;br /&gt;uymayacağını nasıl bilebilirdi kimse? Yani kısacası inanmadım.&lt;br /&gt;Sonra tam olarak ne zaman bilmiyorum ama matematikçilerin sonsuz&lt;br /&gt;sayıda cisimle ilgili kesin iddialarda bulunabildiklerini anladım.&lt;br /&gt;Çok daha sonra da aslında matematiği diğer uğraşılardan ayıran&lt;br /&gt;temel özelliğin bu olduğunu ve başka hiçbir şeyden emin&lt;br /&gt;olmadığımızı...&lt;br /&gt;&lt;br /&gt;Neden anlatıyorum bu hikayeyi? Çünkü o günden beri benim "inci"&lt;br /&gt;olarak nitelendirdiğim fikirlerin özelliklerini çok güzel&lt;br /&gt;örneklendiriyor. Bir inci ilk duyulduğunda yarattığı tepki "hadi&lt;br /&gt;canım" olmalı. Bu his güzel bir sihirbazlık numarası gördüğümde&lt;br /&gt;hissettiğim incredulity'nin aynısı. Te sonra bir zaman "Aha"&lt;br /&gt;dedirtmeli anlayınca. Bilim ve matematikte doğru ve faydalı&lt;br /&gt;fikirler çok, ama aralarında insana önce "hadi canım" sonra "aha"&lt;br /&gt;dedirtenler az.&lt;br /&gt;&lt;br /&gt;Bu açıdan bakınca bazı sorulara kesin cevaplar verebilmemiz de&lt;br /&gt;prensipte cevaplanamayacak sorular olması da bana ilginç geliyor.&lt;br /&gt;Bilincin, özgür iradenin, ve zaman akışının birer ilüzyon olabilme&lt;br /&gt;olasılığı ilginç geliyor. Elektronların izlenip izlenmediklerine&lt;br /&gt;göre bir o delikten bir bu delikten geçmeleri ilginç geliyor.&lt;br /&gt;Dünyadaki elmalarla uzaydaki yıldızların aynı atomlardan oluşup&lt;br /&gt;aynı yerçekimi kurallarına uyması ilginç geliyor. Sonlu sayıda&lt;br /&gt;veriye bakıp, daha önce görmediğimiz şeylerle ilgili tahminlerde&lt;br /&gt;bulunabilmemiz, kısaca "öğrenmenin" ve "bilimin" mümkün oluşu ve&lt;br /&gt;limitlerinin ne olduğu ilginç geliyor. Borsada hayat boyu para&lt;br /&gt;kazanamasam da, para kazanılamayacağını ispatlamanın imkansız&lt;br /&gt;oluşu ya da pokerde her ne kadar hesaplayamasam da hiçbir zaman&lt;br /&gt;para kaybetmeyecek bir stratejinin var oluşu ilginç geliyor. Kedi&lt;br /&gt;yavrularının beyinlerinde görmelerini sağlayan center-surround&lt;br /&gt;hücreler bulunması, daha sonra bunların aslında iki boyutlu türev&lt;br /&gt;aldıklarını anlamamız, daha sonra rastgele bağlanmış bir sınır&lt;br /&gt;ağına rastgele resimler gösterildiğinde hücrelerin basit bir&lt;br /&gt;kuralla kendilerini bu şekilde bağlayabildiklerinin gösterilmesi&lt;br /&gt;bana çok ama çok ilginç geliyor.&lt;br /&gt;&lt;br /&gt;Ve bunların hiçbiri okullarda çocuklara anlatılmıyor! Tüm bu&lt;br /&gt;bahsettiklerimi ilgili bir ortaokul öğrencisine çok rahat&lt;br /&gt;anlatabileceğimi biliyorum. Bunu bildiğim halde Bilim ve Teknik&lt;br /&gt;dergisinin mesajlarıma cevap yazmaması, Tübitak'ın kitap&lt;br /&gt;çevirileri yaparken fikrimizi sormaması, kendi çocuğuma ne&lt;br /&gt;öğretileceğine karar verecek milli eğitim bakanlığının zavallı&lt;br /&gt;insanlar tarafından yönetilmesi, çalıştığım okulun öğrettiklerimin&lt;br /&gt;kalitesiyle de içeriğiyle de pek ilgilenmemesi, ve ıvır zıvır&lt;br /&gt;işlerle uğraşırken zamanın geçip gitmesi beni çileden çıkarıyor.&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-516881834920692914?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://tech.groups.yahoo.com/group/ariteknokent/message/819' title='Fikir incileri'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/516881834920692914/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=516881834920692914' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/516881834920692914'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/516881834920692914'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2007/02/fikir-incileri.html' title='Fikir incileri'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-2771949801053235405</id><published>2007-02-07T16:40:00.000+02:00</published><updated>2010-11-03T09:08:54.424+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Books'/><title type='text'>Project Orion by George Dyson</title><content type='html'>"A universal library, whether of books, genotypes, or technologies, forms an expanding cloud of possibilities in a multidimensional space.  The laws of nature form an outermost bound.  A smaller cloud, condensed out of this atmosphere of possibilities, represents the organisms of technologies that can be assembled from available parts.  Finally, a small central core - where we live - represents the books, organisms, or technologies that exist at the present time.  Instead of building outward by small increments, Ted (Taylor) sought to develop Orion the other way around: start with the laws of nature; delineate the bounds first of possibility and then of practicality; finally, trace a path backward to existing technology so as to advance not by increments but by leaps and bounds." -- quote from pp. 98&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-2771949801053235405?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/2771949801053235405/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=2771949801053235405' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/2771949801053235405'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/2771949801053235405'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2007/02/project-orion-by-george-dyson.html' title='Project Orion by George Dyson'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-1471878154069318655</id><published>2007-02-01T14:19:00.000+02:00</published><updated>2010-11-03T22:07:38.028+02:00</updated><title type='text'>About me</title><content type='html'>I am an assistant professor in &lt;a href="http://comp.ku.edu.tr"&gt;Computer Engineering&lt;/a&gt; at &lt;a href="http://www.ku.edu.tr"&gt;Koç University&lt;/a&gt;.  Previously I have spent 12 years at the &lt;a href="http://www.csail.mit.edu"&gt;MIT AI Lab&lt;/a&gt; and co-founded &lt;a href="http://www.inquira.com"&gt;Inquira, Inc.&lt;/a&gt;  My research is in natural language processing and machine learning.&lt;br /&gt;&lt;br /&gt;Currently I am one of the organizers of the &lt;a href="http://aclweb.org/aclwiki/index.php?title=SemEval_Portal"&gt;SemEval3&lt;/a&gt; semantic evaluation exercise, co-chair for the &lt;a href="http://www.acl2011.org"&gt;ACL 2011&lt;/a&gt; semantics area, and an editor for the &lt;a href="http://cljournal.org"&gt;Computational Linguistics Journal&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;I do research with both undergraduate and graduate students with a strong background in mathematics and programming.  If interested, take a look at some of my &lt;a href="http://docs.google.com/View?id=d2jm3f3_41dwpnrn"&gt;project ideas&lt;/a&gt;, &lt;a href="http://denizyuret.blogspot.com/search/label/Publications"&gt;papers&lt;/a&gt;, &lt;a href="http://denizyuret.blogspot.com/2009/01/classes.html"&gt;classes&lt;/a&gt;, and &lt;a href="http://denizyuret.blogspot.com/search/label/Students"&gt;past students&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;For more information please visit my &lt;a href="http://denizyuret.blogspot.com"&gt;blog&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;&lt;hr/&gt;&lt;a href="http://www.ku.edu.tr/main.php?lang=tr"&gt;Koç Üniversitesi&lt;/a&gt; &lt;a href="http://comp.ku.edu.tr"&gt;Bilgisayar Mühendisliği Bölümü&lt;/a&gt;'nde öğretim üyesiyim.  Bundan önce &lt;a href="http://www.csail.mit.edu"&gt;MIT Yapay Zeka Laboratuarı&lt;/a&gt;'nda 12 yıl çalıştım ve &lt;a href="http://www.inquira.com"&gt;Inquira, Inc.&lt;/a&gt; şirketini kurdum.  Araştırmalarım doğal dil işleme ve otomatik öğrenme alanlarındadır.&lt;br /&gt;&lt;br /&gt;Bu yıl &lt;a href="http://aclweb.org/aclwiki/index.php?title=SemEval_Portal"&gt;SemEval3&lt;/a&gt; anlamsal analiz yarışmasının organizatörlüğünü, &lt;a href="http://www.acl2011.org"&gt;ACL 2011&lt;/a&gt; konferansı için anlamsal analiz alanında kurul başkanlığını ve &lt;a href="http://cljournal.org"&gt;Computational Linguistics&lt;/a&gt; dergisinin yayın kurulu üyeliğini yürütmekteyim.&lt;br /&gt;&lt;br /&gt;Benimle araştırma yapmak isteyen lisans ve lisansüstü öğrencilere kendilerini matematik, programlama ve İngilizce alanlarında iyi yetiştirmelerini tavsiye ediyorum.  İlgilenirseniz &lt;a href="http://docs.google.com/View?id=d2jm3f3_41dwpnrn"&gt;proje fikirlerime&lt;/a&gt;, &lt;a href="http://denizyuret.blogspot.com/search/label/Publications"&gt;makalelerime&lt;/a&gt;, &lt;a href="http://denizyuret.blogspot.com/search/label/Students"&gt;mezun öğrencilerime&lt;/a&gt; ve &lt;a href="http://denizyuret.blogspot.com/2009/01/classes.html"&gt;verdiğim derslere&lt;/a&gt; bir göz atabilirsiniz.&lt;br /&gt;&lt;br /&gt;Daha fazla bilgi için lütfen &lt;a href="http://denizyuret.blogspot.com"&gt;blog&lt;/a&gt;'umu ziyaret edebilir &lt;a href="http://denizyuret.blogspot.com/search/label/T%C3%BCrk%C3%A7e"&gt;Türkçe yazılarıma&lt;/a&gt; bakabilirsiniz.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-1471878154069318655?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/1471878154069318655/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=1471878154069318655' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/1471878154069318655'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/1471878154069318655'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2007/02/about-me.html' title='About me'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-3547402507652090573</id><published>2007-02-01T10:53:00.006+02:00</published><updated>2011-10-15T18:08:43.806+03:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Classes'/><title type='text'>Classes</title><content type='html'>Fall 2011&lt;br /&gt;&lt;a href="http://courses.ku.edu.tr/comp341"&gt;Comp341&lt;/a&gt; Artificial Intelligence&lt;br /&gt;&lt;br /&gt;Fall 2010&lt;br /&gt;&lt;a href="http://ais.ku.edu.tr/course/15093/Default.html"&gt;Ecoe554&lt;/a&gt; Machine Learning&lt;br /&gt;&lt;br /&gt;Spring 2010&lt;br /&gt;&lt;a href="http://ais.ku.edu.tr/course/14578/Default.html"&gt;Ecoe580&lt;/a&gt; NLP Seminar&lt;br /&gt;&lt;br /&gt;Fall 2009&lt;br /&gt;&lt;a href="http://ais.ku.edu.tr/course/13473/Default.html"&gt;Comp101&lt;/a&gt; Structure and Interpretation of Computer Programs&lt;br /&gt;&lt;a href="http://ais.ku.edu.tr/course/13607/Default.html"&gt;Ecoe554&lt;/a&gt; Machine Learning&lt;br /&gt;&lt;br /&gt;Spring 2009&lt;br /&gt;&lt;a href="http://ais.ku.edu.tr/course/12442/Default.html"&gt;Comp101&lt;/a&gt; Structure and Interpretation of Computer Programs&lt;br /&gt;&lt;a href="http://ais.ku.edu.tr/course/12447/Default.html"&gt;Engr200&lt;/a&gt; Probability and Statistical Methods for Engineers&lt;br /&gt;&lt;a href="http://ais.ku.edu.tr/course/12718/Default.html"&gt;Ecoe580&lt;/a&gt; Semisupervised Learning&lt;br /&gt;&lt;span class="fullpost"&gt;&lt;br /&gt;Fall 2008&lt;br /&gt;&lt;a href="http://ais.ku.edu.tr/course/12007/Default.html"&gt;Comp101&lt;/a&gt; Structure and Interpretation of Computer Programs&lt;br /&gt;&lt;a href="http://ais.ku.edu.tr/course/12079/Default.html"&gt;Ecoe554&lt;/a&gt; Machine Learning&lt;br /&gt;&lt;br /&gt;Spring 2008&lt;br /&gt;&lt;a href="http://ais.ku.edu.tr/course/10764/Default.html"&gt;Comp101&lt;/a&gt; Structure and Interpretation of Computer Programs&lt;br /&gt;&lt;a href="http://ais.ku.edu.tr/course/10721/Default.html"&gt;Engr200&lt;/a&gt; Probability and Statistical Methods for Engineers&lt;br /&gt;&lt;br /&gt;Fall 2007&lt;br /&gt;&lt;a href="http://www.denizyuret.com/kunlp/"&gt;KUNLP&lt;/a&gt; Reading group&lt;br /&gt;&lt;a href="http://ais.ku.edu.tr/course/10328/Default.html"&gt;Comp101&lt;/a&gt; Structure and Interpretation of Computer Programs&lt;br /&gt;&lt;a href="http://ais.ku.edu.tr/course/10332/Default.html"&gt;Comp131&lt;/a&gt; Introduction to Programming&lt;br /&gt;&lt;br /&gt;Spring 2007&lt;br /&gt;&lt;a href="http://ais.ku.edu.tr/course/9651/Default.html"&gt;Comp101&lt;/a&gt; Structure and Interpretation of Computer Programs&lt;br /&gt;&lt;a href="http://ais.ku.edu.tr/course/9638/Default.html"&gt;Engr200&lt;/a&gt; Probability and Statistical Methods for Engineers&lt;br /&gt;&lt;br /&gt;Fall 2006&lt;br /&gt;&lt;a href="http://ais.ku.edu.tr/course/9204/Default.html"&gt;Comp101&lt;/a&gt; Structure and Interpretation of Computer Programs&lt;br /&gt;&lt;br /&gt;Spring 2006&lt;br /&gt;&lt;a href="http://ais.ku.edu.tr/course/8566/Default.html"&gt;Ecoe554&lt;/a&gt; Machine Learning&lt;br /&gt;&lt;br /&gt;Fall 2005&lt;br /&gt;&lt;a href="http://ais.ku.edu.tr/course/7963/Default.html"&gt;Comp101&lt;/a&gt; Structure and Interpretation of Computer Programs&lt;br /&gt;&lt;a href="http://ais.ku.edu.tr/course/7965/Default.html"&gt;Engr200&lt;/a&gt; Probability and Statistical Methods for Engineers&lt;br /&gt;&lt;br /&gt;Spring 2005&lt;br /&gt;&lt;a href="http://ais.ku.edu.tr/course/7317/Default.html"&gt;Comp101&lt;/a&gt; Structure and Interpretation of Computer Programs&lt;br /&gt;&lt;br /&gt;Fall 2004&lt;br /&gt;&lt;a href="http://ais.ku.edu.tr/course/6602/Default.html"&gt;Engr200&lt;/a&gt; Probability and Statistical Methods for Engineers&lt;br /&gt;&lt;a href="http://ais.ku.edu.tr/course/7069/Default.html"&gt;Ecoe554&lt;/a&gt; Machine Learning&lt;br /&gt;&lt;br /&gt;Spring 2004&lt;br /&gt;&lt;a href="http://ais.ku.edu.tr/course/6038/Default.html"&gt;Comp101&lt;/a&gt; Structure and Interpretation of Computer Programs&lt;br /&gt;&lt;br /&gt;Fall 2003&lt;br /&gt;Comp100 Computer Applications&lt;br /&gt;Ecoe554 Machine Learning&lt;br /&gt;&lt;br /&gt;Spring 2003&lt;br /&gt;Math102 Calculus&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-3547402507652090573?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/3547402507652090573/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=3547402507652090573' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/3547402507652090573'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/3547402507652090573'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2009/01/classes.html' title='Classes'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-116947797581360361</id><published>2007-01-22T16:36:00.000+02:00</published><updated>2010-11-03T09:08:54.425+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Links'/><title type='text'>Top science videos</title><content type='html'>I have not been this excited about the internet since 1994, when people at the AI Lab would run mosaic everyday and check to see if there were any new pages.  My new addiction is videos!   You can find quite a number of lectures, interviews etc.  and it somehow makes a big difference to see the person talking, rather than just listening or reading.  Here is a partial list of my favorites.&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;span class="down" style="display: block;" id="formatbar_CreateLink" title="Link" onmouseover="ButtonHoverOn(this);" onmouseout="ButtonHoverOff(this);" onmouseup="" onmousedown="CheckFormatting(event);FormatbarButton('richeditorframe', this, 8);ButtonMouseDown(this);"&gt;&lt;a href="http://www.ted.com/tedtalks"&gt;http://www.ted.com/tedtalks&lt;/a&gt;&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.poptech.com/popcasts"&gt;http://www.poptech.com/popcasts&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.communistrobot.com"&gt;http://www.communistrobot.com&lt;/a&gt;&lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;span class="down" style="display: block;" id="formatbar_CreateLink" title="Link" onmouseover="ButtonHoverOn(this);" onmouseout="ButtonHoverOff(this);" onmouseup="" onmousedown="CheckFormatting(event);FormatbarButton('richeditorframe', this, 8);ButtonMouseDown(this);"&gt;&lt;a href="http://www.edge.org/edge_video.html"&gt;http://www.edge.org/edge_video.html&lt;/a&gt;&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span class="down" style="display: block;" id="formatbar_CreateLink" title="Link" onmouseover="ButtonHoverOn(this);" onmouseout="ButtonHoverOff(this);" onmouseup="" onmousedown="CheckFormatting(event);FormatbarButton('richeditorframe', this, 8);ButtonMouseDown(this);"&gt;&lt;a href="http://mitworld.mit.edu/index.php"&gt;http://mitworld.mit.edu/index.php&lt;/a&gt;&lt;br /&gt;&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span class="down" style="display: block;" id="formatbar_CreateLink" title="Link" onmouseover="ButtonHoverOn(this);" onmouseout="ButtonHoverOff(this);" onmouseup="" onmousedown="CheckFormatting(event);FormatbarButton('richeditorframe', this, 8);ButtonMouseDown(this);"&gt;&lt;a href="http://videolectures.net/"&gt;http://videolectures.net&lt;/a&gt;&lt;br /&gt;&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span class="down" style="display: block;" id="formatbar_CreateLink" title="Link" onmouseover="ButtonHoverOn(this);" onmouseout="ButtonHoverOff(this);" onmouseup="" onmousedown="CheckFormatting(event);FormatbarButton('richeditorframe', this, 8);ButtonMouseDown(this);"&gt;&lt;a href="http://www.singinst.org/media/"&gt;http://www.singinst.org/media&lt;/a&gt;&lt;br /&gt;&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span class="down" style="display: block;" id="formatbar_CreateLink" title="Link" onmouseover="ButtonHoverOn(this);" onmouseout="ButtonHoverOff(this);" onmouseup="" onmousedown="CheckFormatting(event);FormatbarButton('richeditorframe', this, 8);ButtonMouseDown(this);"&gt;&lt;a href="http://www.longnow.org/projects/seminars/"&gt;http://www.longnow.org/projects/seminars/&lt;/a&gt;&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;span class="down" style="display: block;" id="formatbar_CreateLink" title="Link" onmouseover="ButtonHoverOn(this);" onmouseout="ButtonHoverOff(this);" onmouseup="" onmousedown="CheckFormatting(event);FormatbarButton('richeditorframe', this, 8);ButtonMouseDown(this);"&gt;&lt;a href="http://www.transhumanism.org/tv"&gt;http://www.transhumanism.org/tv&lt;/a&gt;&lt;br /&gt;&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://meaningoflife.tv/"&gt;&lt;span class="down" style="display: block;" id="formatbar_CreateLink" title="Link" onmouseover="ButtonHoverOn(this);" onmouseout="ButtonHoverOff(this);" onmouseup="" onmousedown="CheckFormatting(event);FormatbarButton('richeditorframe', this, 8);ButtonMouseDown(this);"&gt;http://meaningoflife.tv/&lt;/span&gt;&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.bbc.co.uk/sn/tvradio/programmes/horizon/broadband/archive/feynman/"&gt;&lt;span class="down" style="display: block;" id="formatbar_CreateLink" title="Link" onmouseover="ButtonHoverOn(this);" onmouseout="ButtonHoverOff(this);" onmouseup="" onmousedown="CheckFormatting(event);FormatbarButton('richeditorframe', this, 8);ButtonMouseDown(this);"&gt;http://www.bbc.co.uk/sn/tvradio/programmes/horizon/broadband/archive/feynman/&lt;/span&gt;&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;span class="down" style="display: block;" id="formatbar_CreateLink" title="Link" onmouseover="ButtonHoverOn(this);" onmouseout="ButtonHoverOff(this);" onmouseup="" onmousedown="CheckFormatting(event);FormatbarButton('richeditorframe', this, 8);ButtonMouseDown(this);"&gt;&lt;a href="http://googleresearch.blogspot.com/2006/12/google-research-picks-for-videos-of.html"&gt;http://googleresearch.blogspot.com/2006/12/google-research-picks-for-videos-of.html&lt;/a&gt;&lt;br /&gt;&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://sapere.alice.it/extra/126/23_settembre.html"&gt;&lt;span class="down" style="display: block;" id="formatbar_CreateLink" title="Link" onmouseover="ButtonHoverOn(this);" onmouseout="ButtonHoverOff(this);" onmouseup="" onmousedown="CheckFormatting(event);FormatbarButton('richeditorframe', this, 8);ButtonMouseDown(this);"&gt;http://sapere.alice.it/extra/126/23_settembre.html&lt;/span&gt;&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.thetech.org/genetics/pov_atkins/index.html"&gt;&lt;span class="down" style="display: block;" id="formatbar_CreateLink" title="Link" onmouseover="ButtonHoverOn(this);" onmouseout="ButtonHoverOff(this);" onmouseup="" onmousedown="CheckFormatting(event);FormatbarButton('richeditorframe', this, 8);ButtonMouseDown(this);"&gt;http://www.thetech.org/genetics/pov_atkins/index.html&lt;/span&gt;&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;span class="down" style="display: block;" id="formatbar_CreateLink" title="Link" onmouseover="ButtonHoverOn(this);" onmouseout="ButtonHoverOff(this);" onmouseup="" onmousedown="CheckFormatting(event);FormatbarButton('richeditorframe', this, 8);ButtonMouseDown(this);"&gt;&lt;a href="http://www.thefutureofscience.org/23rd_evolMind.htm"&gt;http://www.thefutureofscience.org/23rd_evolMind.htm&lt;/a&gt;&lt;br /&gt;&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://technetcast.ddj.com/tnc_play_stream.html?stream_id=256"&gt;&lt;span class="down" style="display: block;" id="formatbar_CreateLink" title="Link" onmouseover="ButtonHoverOn(this);" onmouseout="ButtonHoverOff(this);" onmouseup="" onmousedown="CheckFormatting(event);FormatbarButton('richeditorframe', this, 8);ButtonMouseDown(this);"&gt;http://technetcast.ddj.com/tnc_play_stream.html?stream_id=256&lt;/span&gt;&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;span class="down" style="display: block;" id="formatbar_CreateLink" title="Link" onmouseover="ButtonHoverOn(this);" onmouseout="ButtonHoverOff(this);" onmouseup="" onmousedown="CheckFormatting(event);FormatbarButton('richeditorframe', this, 8);ButtonMouseDown(this);"&gt;&lt;a href="http://sss.stanford.edu/coverage/audioandvideo/"&gt;http://sss.stanford.edu/coverage/audioandvideo/&lt;/a&gt;&lt;br /&gt;&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.aaai.org/AITopics/html/show.html"&gt;&lt;span class="down" style="display: block;" id="formatbar_CreateLink" title="Link" onmouseover="ButtonHoverOn(this);" onmouseout="ButtonHoverOff(this);" onmouseup="" onmousedown="CheckFormatting(event);FormatbarButton('richeditorframe', this, 8);ButtonMouseDown(this);"&gt;http://www.aaai.org/AITopics/html/show.html&lt;/span&gt;&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;span class="down" style="display: block;" id="formatbar_CreateLink" title="Link" onmouseover="ButtonHoverOn(this);" onmouseout="ButtonHoverOff(this);" onmouseup="" onmousedown="CheckFormatting(event);FormatbarButton('richeditorframe', this, 8);ButtonMouseDown(this);"&gt;&lt;a href="http://web.mit.edu/webcast/csail/2006/"&gt;http://web.mit.edu/webcast/csail/2006/&lt;/a&gt;&lt;br /&gt;&lt;/span&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.quiprocone.org/Protected/Lecture1.htm"&gt;&lt;span class="down" style="display: block;" id="formatbar_CreateLink" title="Link" onmouseover="ButtonHoverOn(this);" onmouseout="ButtonHoverOff(this);" onmouseup="" onmousedown="CheckFormatting(event);FormatbarButton('richeditorframe', this, 8);ButtonMouseDown(this);"&gt;http://www.quiprocone.org/Protected/Lecture1.htm&lt;/span&gt;&lt;/a&gt;&lt;/li&gt;&lt;li style="text-align: left;"&gt;&lt;a href="http://swiss.csail.mit.edu/classes/6.001/abelson-sussman-lectures"&gt;&lt;span class="down" style="display: block;" id="formatbar_CreateLink" title="Link" onmouseover="ButtonHoverOn(this);" onmouseout="ButtonHoverOff(this);" onmouseup="" onmousedown="CheckFormatting(event);FormatbarButton('richeditorframe', this, 8);ButtonMouseDown(this);"&gt;http://swiss.csail.mit.edu/classes/6.001/abelson-sussman-lectures&lt;/span&gt;&lt;/a&gt;&lt;/li&gt;&lt;/ul&gt;&lt;span class="down" style="display: block;" id="formatbar_CreateLink" title="Link" onmouseover="ButtonHoverOn(this);" onmouseout="ButtonHoverOff(this);" onmouseup="" onmousedown="CheckFormatting(event);FormatbarButton('richeditorframe', this, 8);ButtonMouseDown(this);"&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-116947797581360361?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/116947797581360361/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=116947797581360361' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/116947797581360361'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/116947797581360361'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2007/01/top-science-videos.html' title='Top science videos'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-8839120873018810375</id><published>2007-01-15T01:51:00.000+02:00</published><updated>2010-11-03T09:08:54.426+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Türkçe'/><title type='text'>İmkansızı başarmak</title><content type='html'>"İmkansızı başarmak" üzere yola çıkan mühendislerin kahramanlık&lt;br /&gt;öykülerini okumayı severim. Richard Rhodes'un "The Making of the&lt;br /&gt;Atomic Bomb", ya da Ben Rich'in "Skunk Works" kitapları bu açıdan&lt;br /&gt;etkileyicidir. Malesef böyle öyküler genelde savaş sırasında ve&lt;br /&gt;büyük devlet projeleri desteğinde ortaya çıkıyor, ama bunu başka&lt;br /&gt;bir mesajda ele alırız. Amacım, her yeni fikrin "acaba dışarıda&lt;br /&gt;böyle bir uygulama var mı" şüpheciliğinden geçirildiği, en büyük&lt;br /&gt;firmaların bile argeye ciddi olarak girmekten korktuğu bu ülkede,&lt;br /&gt;neyin imkanlı neyin imkansız olduğunu ve yeni fikirleri "başkası&lt;br /&gt;yapmış mı"'nın ötesinde nasıl değerlendirebileceğimizi tartışmaya&lt;br /&gt;açmak. Bakarsınız bir gün bir konuda liderliği ele geçiririz, ve&lt;br /&gt;"başkası ne yapmış" diye kopya çekme imkanımız kalmaz ve geçmişe&lt;br /&gt;değil geleceğe bakmak zorunda kalıveririz :) &lt;span class="fullpost"&gt;&lt;br /&gt;&lt;br /&gt;Bu sefer George Dyson'un "Project Orion" kahramanlık öyküsünü&lt;br /&gt;okurken, projedeki mühendislerin düşünce yapısıyla ilgili ilginç&lt;br /&gt;bir paragraf dikkatimi çekti. Önce Borges'in "Evrensel Kütüphane"&lt;br /&gt;fikrini açmalıyım. Değişik yazarların değişik şekillerde formüle&lt;br /&gt;ettiği bu hayali kütüphane, tüm olası kitapların, tüm olası&lt;br /&gt;teknolojilerin, tüm olası organizmaların raflarından birinde yer&lt;br /&gt;aldığı bir olasılıklar evrenidir.&lt;br /&gt;&lt;br /&gt;Dyson proje lideri Ted Taylor hakkında şöyle diyor (parantezler&lt;br /&gt;benim): "Kitaplardan, genotiplerden, ya da teknolojilerden oluşan&lt;br /&gt;bir evrensel kütüphane, çok boyutlu bir uzayda gittikçe genişleyen&lt;br /&gt;bir olasılıklar bulutu gibidir. Doğa kanunları en dış sınırları&lt;br /&gt;oluşturur (hiyerarşi #4). Bu olasılıklar atmosferinin ortasında&lt;br /&gt;yoğunlaşmış daha küçük bir bulut halihazırda elimizde olan&lt;br /&gt;parçalardan üretebileceğimiz organizma ve teknolojileri temsil&lt;br /&gt;eder (hiyerarşi #2). Son olarak merkezdeki küçük bir çekirdek şu&lt;br /&gt;anda var olan kitapları, organizmaları ve teknolojileri temsil&lt;br /&gt;eder (hiyerarşi #1).  Ted, Orion'u çekirdekten dışarı doğru küçük&lt;br /&gt;adımlarla değil diğer yönde geliştirmek istiyordu: doğa&lt;br /&gt;kanunlarıyla başlanacak; önce imkanlılık, daha sonra pratiklik&lt;br /&gt;sınırları çizilecek; son olarak bulunan noktalardan geriye, şu an&lt;br /&gt;elimizde olan teknolojilere doğru bir yol çizilecekti. Ancak böyle&lt;br /&gt;küçük adımlarla değil dev hamlelerle ilerlemek mümkün olabilirdi."&lt;br /&gt;&lt;br /&gt;Bir gün sınırlarımıza doğa kanunlarından başlayarak karar&lt;br /&gt;verebilmek dilegiyle!&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8540876-8839120873018810375?l=denizyuret.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://tech.groups.yahoo.com/group/ariteknokent/message/751' title='İmkansızı başarmak'/><link rel='replies' type='application/atom+xml' href='http://denizyuret.blogspot.com/feeds/8839120873018810375/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8540876&amp;postID=8839120873018810375' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/8839120873018810375'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8540876/posts/default/8839120873018810375'/><link rel='alternate' type='text/html' href='http://denizyuret.blogspot.com/2007/01/imkansz-basarmak.html' title='İmkansızı başarmak'/><author><name>Deniz Yuret</name><uri>http://www.blogger.com/profile/00578023665603100985</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='24' height='32' src='http://ais.ku.edu.tr/etc/iphoto/DYURET.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8540876.post-145294730567548646</id><published>2007-01-07T01:40:00.001+02:00</published><updated>2010-11-03T09:08:54.427+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Türkçe'/><title type='text'>Human computation</title><content type='html'>CMU'dan genç bir asistant prof, bilgisayarların zorlandığı bir&lt;br /&gt;takım problemleri (görüntü tanıma, kelimeleri anlama gibi) insan&lt;br /&gt;beyinlerine çözdürme fikrini anlatıyor aşağıdaki videoda:&lt;br /&gt;&lt;br /&gt;&lt;a href="http://video.google.com/videoplay?docid=-8246463980976635143&amp;amp;q=techtalks"&gt;http://video.google.com/videoplay?docid=-8246463980976635143&amp;amp;q=techtalks&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;Aslinda bu yeni bir fikir değil. MIT'den iki arkadaşım Push Singh,&lt;br /&gt;ve Tim Chklovski web üzerinden gönüllü insanların bilgilerinden&lt;br /&gt;yararlanmak için benzer projeler başlatmışlardı:&lt;br /&gt;&lt;br /&gt;&lt;a href="http://openmind.org/"&gt;http://openmind.org/&lt;/a&gt;&lt;br /&gt;&lt;a href="http://learner.isi.edu/"&gt;http://learner.isi.edu/&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;Beğenir daha isterseniz Google seminerlerinden en iyilerini Peter&lt;br /&gt;Norvig seçip bir top 20 listesi hazırlamış:&lt;br /&gt;&lt;br /&gt;&lt;a href="http://googleresearch.blogspot.com
