Reviews & Opinions
Independent and trusted. Read before buy Theben Aton 3-P!

Theben Aton 3-P


Bookmark
Theben Aton 3-P

Bookmark and Share

 

Theben Aton 3-PAbout Theben Aton 3-P
Here you can find all about Theben Aton 3-P like manual and other informations. For example: review.

Theben Aton 3-P manual (user guide) is ready to download for free.

On the bottom of page users can write a review. If you own a Theben Aton 3-P please write about it to help other people.
[ Report abuse or wrong photo | Share your Theben Aton 3-P photo ]

 

 

Manual

Download (English)

 

Theben Aton 3-P

 

 

User reviews and opinions

<== Click here to post a new opinion, comment, review, etc.

Comments to date: 2. Page 1 of 1. Average Rating:
inan 10:22pm on Saturday, September 11th, 2010 
My Company uses Citrix, so I am able to run Windows Applications, SAP, even flash and all my GO TO corporate applications on the device. Does this device have any real flaws? Lets address some real shortcomings of the iPad. The iPad is exactly what I expected, easy to use, very well executed so long as you understand that it is mainly a device to consume media.
Andy_DKW 6:50pm on Tuesday, July 6th, 2010 
PROS: OS, look, Awesomeness ITs great, and the idea is well along with the OS its a Mac downsized. its size is a bit big Awesome game player, and has replaced my laptop but I do not have to need for business and so I do not know about how those work. Great for traveling,...

Comments posted on www.ps2netdrivers.net are solely the views and opinions of the people posting them and do not necessarily reflect the views or opinions of us.

 

Documents

doc1

TESTING SUPRASEGMENTAL ENGLISH THROUGH PARROTING Joseph Tepperman, Theban Stanley, Kadri Hacioglu, and Bryan Pellom Rosetta Stone Labs Boulder, Colorado, USA
{jtepperman,tstanley,khacioglu,bpellom}@rosettastone.com
ABSTRACT Parroting exercises in a foreign language are designed to make a students speech more native-like through imitation of specic native speech templates. In this paper we describe novel template-based methods for automatically estimating subjective scores for both intonation and rhythm in nonnative English. In terms of accuracy when automatically classifying a parroting speaker as a native or a learner, experimental results show that these new rhythm and intonation scores outperform similar baselines from nonnative speech assessment literature, and that they offer complementary discriminatory information when combined with automatic segment-level pronunciation scores, reaching a maximum classication accuracy of 89.8% on a corpus of parroting exercises. This suggests the general usefulness of these new scores in automatically assessing nonnative pronunciation in a computer-assisted pronunciation practice scenario. Index Terms nonnative speech, pronunciation evaluation, suprasegmental features, second-language acquisition 1. INTRODUCTION Not unlike talking parrots, students of a second language (L2) can learn native-like pronunciation by imitating the speech they hear. Parroting is the attempt on the part of a language learner to reproduce any aurally-transmitted aspect of L2 pronunciation, in an effort to sound more native-like and eliminate the inuence of their own native language (L1). This can encompass pronunciation on multiple linguistic levels - both segmental and suprasegmental imitation. It can also happen implicitly, as in complete immersion in a foreign country, or explicitly, as in a language practice scenario, in response to spoken or recorded prompts designed to elicit repetition. The paradigm of student imitation of audio prompts is pervasive throughout second-language instruction [1, 2]. It has proven to be an effective way to teach phonological, suprasegmental, and conversational elements of an L2. Advanced learners can also benet from parroting exercises, in learning
1 Thanks are due to Daniel Stutzman for implementing the data collection,
to produce subtle contrasts between close phonemes, or acquiring the rhythm and intonation of a second language (a task usually reserved for students who have already mastered basic vocabulary and segment-level pronunciation). This paper will specically address the problem of automatically scoring suprasegmental practice for Japanese learners of English. To assess a students accuracy in parroting an audio prompt, we propose a template-based scoring method. By template-based we mean a direct comparison between any input student utterance and the native speech audio prompt that elicited it (the reference template) by way of some kind of distance metric, but without any model abstraction beyond that. This seemed attractive for evaluating parroting for several reasons. For one, intonation and rhythm are highly variable in English, even among native speakers [3]. The tune to which a phrase is set depends on the speakers semantic intent, and also on their speaking style and emotional content. If a student is encouraged to imitate a specic prompts suprasegmental realization, then that very prompt must serve as a reference. Though this limits the assessment to phrases that have been pre-recorded, the method is inexpensive computationally (it requires no training) and in terms of transcription, requiring no specialized prosodic annotation. In this paper we will describe some novel template-based methods for estimating both intonation and rhythm scores. These proposed methods will be assessed alongside baselines from nonnative speech assessment literature, and all methods will be compared in terms of performance in automatically classifying a parroter as an English native or learner. It is our goal to show improved discrimination over the baselines and to demonstrate that these suprasegmental scores offer complementary information when combined with segment-level scores. The next section describes the corpus of parroting recordings used here. Following that is a description of the proposed methods and baselines. Finally, we will discuss the results of experiments in automatic speech classication. 2. SPEECH CORPUS As mentioned in Section 1, this paper is focused on the task of evaluating suprasegmental parroting produced by Japanese students of English. The speech data used in all the exper-

and to all the Rosetta Stone employees who participated in it.
Japanese Learners Female Speakers Male Speakers Total Phrases Total Hours 1478 1.3

Native English 1592 1.4

Reference Prompts 100 0.1
Table 1. Speech corpus statistics. iments below is divided into three sub-corpora: Japanese learner parroting, Native English speaker parroting, and reference audio prompts. All recordings were made with 16-bit resolution, sampled at 16 kHz, and all speakers parroted some subset of 100 English phrases, hand-selected to represent a variety in length, subject matter, and suprasegmental content. Statistics about these three sets are given in Table 1. The reference prompts were produced by 4 professional voicers (2 male, 2 female) and were previously used as actual prompts in the Rosetta Stone Version 3 American English product. These recordings were processed and edited (including multiband compression and equalization) to maximize intelligibility. In some sense these pronunciations are articial - they are enunciated much more deliberately and dramatically than in ordinary speech or even formal reading. But as prompts they are appropriate, with intonation, rhythm, and phonetic quality unambiguous for purposes of imitation. The learner parroting recordings were collected from native Japanese speakers who also live in Japan. Though none of them were uent in English, the speakers encompassed a wide range of English prociency. All recordings selected for these experiments were checked to ensure that they were at least devoid of noise and that they were in-grammar, i.e. that the student produced all of the words in the prompt. These learners were not explicitly told to parrot the reference prompts suprasegmentally. Even so, the inuence of the prompts intonation and rhythm were palpable, with many learners going as far as to take on the professional voicers emphatic style. To investigate the upper bound on parroting accuracy, we also collected recordings from native speakers of English who are employees of Rosetta Stone. Each speaker was explicitly told to try and match the voicers rhythm and intonation, and they were allowed to listen to the prompt and record their own version as many times as they wanted. None of them were professional voicers. Though these conditions were not the same as in the Japanese collection, this yielded two very different data sets: one with presumably procient native parroting, and one with learner parroting of diverse prociency. 3. METHODS AND METRICS To estimate scores for suprasegmental parroting, we begin by automatically segmenting the students speech into syllables and generating segment-level acoustic scores. With this syllable segmentation we compare the students segmentation to that of the reference prompt, and generate an appropriate

score for rhythm similarity. Then we estimate the students fundamental frequency (f0) contour, and compare it to that of the prompt - an intonation similarity measure is calculated from there. No energy-based scoring is done because the prompt audio was processed such that the energy was more or less uniform throughout, and so it cant serve as a good reference. These steps are explained in detail in this section. 3.1. Speech Segmentation Automatic speech segmentation was done using forced Viterbi decoding of the target utterance using Rosetta Stones proprietary speech recognition system. The segmentation process provided both word-level and phoneme-level alignments of the speech data. The decoded sequence of phonemes was then chunked into syllables based on each words expected syllabication according to a pronunciation dictionary. The decoding grammar allowed for possible word deletion and silence insertion, to be expected in learner speech. Phonemes were assigned a pronunciation score based on a standard likelihood ratio. These scores were aggregated into an overall score for an utterances segmental pronunciation. 3.2. Rhythm Measure To compare the rhythm of the prompt and student utterances, the number of syllables for comparison needed to be identical for both. Due to pronunciation variants in the recognition lexicon, some student phrases may have been decoded with a different number of phonemes or even syllables from the template. For example, the word temperature (which occurred in our corpus) can be pronounced either as /tEmp/ with 3 syllables or as /tEmp@/ with 4. In cases where the number of syllables did not match, we backed off to word-level rhythmic analysis only for the word(s) with the differing number of syllables. Also, to compensate for differences in speaking rate, the templates syllable durations were linearly scaled so that the total duration would match that of the student. The rhythm of speech is characterized not just by the durations of segments but by their contrast - between strong and weak, long and short, stressed and unstressed [3]. A common way of quantifying rhythmic contrast is the Pairwise Variability Index (PVI) [4, 5]; it is essentially the mean difference in duration between pairs of adjacent units (usually syllables) over an utterance. For directly comparing two speakers rhythms, we propose a measure called the Pairwise Variability Error (PVE). Given a sequence of student durations {s1 , s2 ,. , sN } (most of which will represent syllables, but sometimes words if the number of syllables did not match the reference), and a sequence of corresponding reference durations {r1 , r2 ,. , rN }, our rhythm score is dened as PV E =

N i=2 N i=2 min(M,i1) |(si sim ) m=1 min(M,i1) |si sim | m=1
(ri rim )| + |ri rim | (1)
It sums up the difference of differences between pairs of syllables in the student and reference utterances, and then normalizes by the total absolute difference. If the student and reference durations are nearly equal, this score tends toward zero; as their difference approaches innity, this score approaches 1. The m variable is the rhythmic context - an integer m 1 - which allows for comparisons between distant syllables (i.e. M is the maximum distance, in syllables, considered for comparison). This is an enhancement over the PVI which only accounts for adjacent pairs. The idea is that the difference in duration between a pair of distant syllables might be more important, as a perceptual correlate of nativeness, than the difference between a pair of adjacent syllables. 3.3. f0 Contour Processing and Intonation Scoring Fundamental frequency (f0) contours for all utterances were estimated using an autocorrelation-based maximum a posteriori method similar to the one presented in [6]. As with the rhythm score, some adjustments were necessary to ensure that the student and reference f0 contours were directly comparable. This was achieved by warping the reference contour to the students length, phoneme-by-phoneme. Phoneme-level warping and concatenation ensured that the references voiced and unvoiced regions would roughly line up with those of the student. In the event that the number of phonemes did not match, warping backed off to the syllable level (but only for the syllables that did not match) and if the syllables were not the same number, then warping backed off to the word level. This warping was done using linear interpolation of the f0 contours - the template was warped to the students durations and not vice-versa so as not to distort what the student produced. After warping, the similarity of the two contours was calculated as the correlation of the frames that were voiced for both the student and the reference. Correlation is a common measure for f0 contour similarity [7, 8], and was used here because it is insensitive to inter-speaker differences in f0 range; the unvoiced frames were ignored rather than interpolated so as not to put words in the students mouth, so to speak. 3.4. Baselines The standard non-template measures we used for our rhythm baselines are dened identically to those in [4, 5]. They all apply to student speech only, and are dened as follows: the standard deviations of vowel, consonant, and syllable durations (V, C, and S), as well as normalized versions of these (varcoV, varcoC, and varcoS); the rate of speaking (ROS) in phonemes per second; the vowel durations as a percentage of each phrase (%V); and the PVI measures for vowels and consonants (PVI-V and PVI-C). The PVI is the inspiration for the PVE measure in Eqn. 1, and is dened as P V I = N 1

M Accuracy %

1 68.0

2 68.1

3 69.4

4 69.0

Table 2. Performance of the PVE over different M in Eqn. 1. where {s1 ,. , sN } is the sequence of segment durations (either vowels or consonants). Sources such as [4, 5] have shown signicant differences in these metrics between native and nonnative English speakers, as well as between languages hypothesized to have fundamentally different rhythms. Similarly, many studies have used Dynamic Time Warping (DTW) to compare f0 contours in template-based intonation scoring [9, 10]. DTW is a standard algorithm that aligns two time sequences that may differ in length. According to some dened cost function, this alignment returns the minimum cost of warping the two sequences together. Following [9], we constrained our DTW so that phoneme boundaries between the student and reference sequences are required to match - this also makes it comparable to the phoneme-level warping proposed in Section 3.3. Likewise, our cost function is the Euclidean distance between the student and reference f0 values - for normalization, they are divided by their sum and the mean number of frames in the two sequences. 4. EXPERIMENTAL RESULTS To reiterate the goals of this study, we are interested in demonstrating that these new rhythm and intonation scoring methods outperform baselines from the literature in classifying native vs. learner parroting. We also intend to show that these scores can offer improvements in classication when combined with the segment-level condence measures introduced in Section 3.1. Combining the recordings from Japanese learners and native English speakers mentioned in Section 2, classication of all phrases as native or learner was performed using a leaveone-speaker-out crossvalidation procedure. For the individual scores described in Section 3, an optimal threshold for classication at the Equal Error Rate (EER) was found through a stepwise search through every folds training instances. Combinations of scores were also investigated using multiple regression (as in [4]), which solved for the coefcients {a0 , a1 ,. , ac } in a set of equations of the form y = a0 + (a1 x1 ) + (a2 x2 ) +. + (ac xc ) (3)

si si+1 (si + si+1 )/2

where each training phrase is represented by one equation, {x0 , x1 ,. , xc } are the scores to be linearly combined, and y is set to 1 for native speech instances and 0 for learner speech instances. After this linear combination of the scores, the optimal EER threshold was found in the same way as for individual scores. The PVE (dened in Eqn. 1) was assessed for M = {1, 2, 3, 4} - these results are reported in Table 2, and the best value of M was used in all experiments with combinations of features (results reported in Table 3).
rhythm baselines new measures all 80.3 69.4 78.2
intonation 65.9 73.5 78.3
rhythm + intonation 75.0 80.0 81.9
rhythm + intonation + segmental 88.7 89.8 89.9
Table 3. Percent accuracy for native/learner classication. Classifying all as native achieved 51.9%; segmental alone achieved 87.2%. Native listeners should approach 100%. 5. DISCUSSION According to Table 2, the value of M with the highest classication accuracy was M = 3. This indicates that context beyond adjacent syllables (M = 1) is important in determining a score that can most accurately discriminate between native and learner rhythm parroting - results at M = 3 were signicantly better than at M = 1 with p 0.03, using McNemars test. The fall-off in performance at M = 4 suggests that rhythmic contexts beyond 3 syllables are not useful here. The results in Table 3 show that the proposed measures are, in general, improvements over the traditional baselines. The exception is the ensemble of rhythm baseline scores - ten in all, as listed in Section 3.4 - which together achieve 80.3% accuracy over the single new rhythm measures 69.4%. However, the best of the baseline rhythm scores (the ROS) managed only 64.2% accuracy on its own, and the other baseline scores performed below chance levels when used individually. This suggests that the new PVE score is better than any of the baselines alone, and that most of the baselines are illsuited for this task unless used in combination. However, the baseline rhythm scores have the advantage that they do not require a reference prompt for score calculation. With the segment-level score described in Section 3.1, all suprasegmental measures - both baselines and novel metrics - offered complementary information for improved classication. The improvement in performance using the combined new measures over the baselines (the rightmost column of Table 3) was not signicant with McNemars test, indicating that the combined new measures are at least as powerful as the baselines, but maybe not moreso. It is interesting to note that the segment-level score alone achieved 87.2% accuracy compared to the new measures combined 80.0%. This shows that these suprasegmental measures have substantial discriminatory power in comparison to their segmental counterparts, a result rarely seen in related work. Ordinarily, in studies like [11], suprasegmental scores perform at only a fraction of a segmental pronunciation scores accuracy. 6. CONCLUSION For automatically classifying an English speaker as a native or learner, the proposed intonation scoring method outperformed a similar baseline; the new rhythm score outper-

formed each baseline individually, but not when the baselines were combined through multiple regression. In combination with segment-level scores, suprasegmentals offered an improvement over either one alone. We can conclude that the proposed measures are improvements over the baselines in estimating these binary pronunciation scores, and that they capture aspects of pronunciation quality not seen on the segment-level. This suggests they are useful for evaluating parroting. Future studies should show correlation with subjective listener scores, beyond this binary classication. 7. REFERENCES [1] M.G. Bus` , New perspectives in teaching pronuncia ation, in From Didactas to Ecolingua, pp. 165182. 2008. [2] M.W. Tanner and M.M. Landon, The effects of computer-assisted pronunciation readings on ESL learners use of pausing, stress, intonation, and overall comprehensibility, Language Learning & Technology, vol. 13, no. 3, pp. 5165, October 2009. [3] P. Ladefoged, A Course in Phonetics, Thomson, Boston, 5th edition, 2006. [4] T.-Y. Jang, Automatic assessment of non-native prosody using rhythm metrics: Focusing on Korean speakers English pronunciation, in Proc. of the 2nd International Conference on East Asian Linguistics, 2009. [5] P.K. Mok and V. Dellwo, Comparing native and nonnative speech rhythm using acoustic rhythmic measures: Cantonese, Beijing Mandarin and English, in Proc. of Speech Prosody, Campinas, Brazil, 2008. [6] J. Droppo and A. Acero, Maximum a posteriori pitch tracking, in Proc. of ICSLP, 1998. [7] R.A.J. Clark and K.E. Dusterhoff, Objective methods for evaluating synthetic intonation, in Proc. of Eurospeech, 1999. [8] J. Mostow and M. Duong, Automated assessment of oral reading prosody, in Proc. of AIED, 2009. [9] D.T. Chappell and J.H.L. Hansen, Speaker-specic pitch contour modeling and modication, in Proc. of ICASSP, Seattle, 1998. [10] M. Suzuki, T. Konno, A. Ito, and S. Makino, Automatic evaluation system of English prosody based on word importance factor, Journal of Systemics, Cybernetics and Informatics, vol. 6, no. 4, pp. 8390, 2008. [11] C. Teixeira, H. Franco, E. Shriberg, K. Precoda, and K. Sonmez, Prosodic features for automatic textindependent evaluation of degree of nativeness for language learners, in Proc. of ICSLP, 2000.

 

Tags

MZ-N520 RQ 745 MS-2387ARS Edition-2007 700937 EB 261 B2030HD KS-F383R 37LD6600A 0 16V Speed 2100 Booklet KT2255 C725BEE Acoustimass 20 DCR-SR78E ZW4104 IC-V85-T Pilot DX VGN-FE21B E2043F P2470HD 2540CDS VM400 CDI210 BC-137 2800-300 Pagepro 1390 MAH3000 KES2042 H2215 Canon T50 MF2140 ICD-B5 FX-980 TX-32LE7F PM635I Stone GR-DVL167 III Gene M45-S2693 PW1550 ZDF501X Hobby 1122 XEA101 C6-NGT KDC-MP425 Cube-40XL RGX-TT 29DL25ED Acerpower S200 EL-9450 MF4370DN HL-4050CDN TCS SPE Pqf30 14PT1332 EWS 1250 WSA80 Pctv 50I UN65C6500VF 42LT75 NV-DV10000B Stilo EAM3000B NWZ-S639F SD205 HDR-FX7E PMP-100 Vmax-2001 Scenarist Triax 20 CD1351S-22 RC-990BX Gpsmap 78 CT-P9050 TA522 KRF-V7010 Laserjet 4550 TXP46G15E Widl 102 Scubapro-uwatec OXY2 PCR-300 LE46C550j1W KV-25X5K A-RV410 Neve 1081 Logicom G350 PES6000E ZWD14791W RDR-HX1000 RT-17LZ50 SHV 4300 KX-TC1460B Review Mixer RSA1dtpe ET 2400 MC1000 Receiver 7AIV5 E

 

manuel d'instructions, Guide de l'utilisateur | Manual de instrucciones, Instrucciones de uso | Bedienungsanleitung, Bedienungsanleitung | Manual de Instruções, guia do usuário | инструкция | návod na použitie, Užívateľská príručka, návod k použití | bruksanvisningen | instrukcja, podręcznik użytkownika | kullanım kılavuzu, Kullanım | kézikönyv, használati útmutató | manuale di istruzioni, istruzioni d'uso | handleiding, gebruikershandleiding

 

Sitemap

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101