Yazar "Uysal, İbrahim" seçeneğine göre listele
Listeleniyor 1 - 15 / 15
Sayfa Başına Sonuç
Sıralama seçenekleri
Öğe Automated essay scoring effect on test equating errors in mixed-format test(IZZET KARA, 2021) Uysal, İbrahim; Doğan, NuriScoring constructed-response items can be highly difficult, time-consuming, and costly in practice. Improvements in computer technology have enabled automated scoring of constructed-response items. However, the application of automated scoring without an investigation of test equating can lead to serious problems. The goal of this study was to score the constructed-response items in mixed-format tests automatically with different test/training data rates and to investigate the indirect effect of these scores on test equating compared with human raters. Bidirectional long-short term memory (BLSTM) was selected as the automated scoring method for the best performance. During the test equating process, methods based on classical test theory and item response theory were utilized. In most of the equating methods, errors of the equating resulting from automated scoring were close to the errors occurring in equating processes conducted by human raters. It was concluded that automated scoring can be applied because it is convenient in terms of equating.Öğe Çizgi filmlerin ilkokul öğrencilerinin duyuşsal özelliklerine etkisine yönelik bir ölçek geliştirme çalışması(2018) Uysal, İbrahim; Sarıça, SelinÇizgi filmlerin ilkokul öğrencilerinin duyuşsal özellikleri üzerindeki etkileri alanyazında çok fazla araştırılmamış bir konu olduğundan geçerliği ve güvenirliği belirlenmiş herhangi bir ölçme aracına rastlanmamıştır. Bu yönde; araştırmanın amacı, çizgi filmlerin ilkokul öğrencilerinin duyuşsal özellikleri üzerindeki etkilerini belirlemeyi amaçlayan bir ölçek geliştirmektir. Araştırmanın çalışma grubunu, Afyonkarahisar ilinde öğrenim gören 234 ve Aydın ilinde öğrenim gören 220 ilkokul öğrencisi oluşturmaktadır. Ölçek geliştirme aşamasında madde geçerliğine yönelik olarak madde ölçek korelasyonları hesaplanmıştır ve alt grup-üst grup ortalamalarının karşılaştırılmasına dayalı madde analizleri gerçekleştirilmiştir. Ölçeğin yapı geçerliğini belirlemek üzere açımlayıcı faktör analizi ve doğrulayıcı faktör analizi yapılmıştır. Açımlayıcı faktör analizinde faktör çıkarma yöntemi olarak temel bileşenler analizi seçilmiştir. İki boyutlu bir yapıda olduğu gözlenen ölçekte boyutlar arasında ilişki bulunduğundan döndürme yöntemi olarak eğik döndürme yöntemlerinden PROMIN seçilmiştir. Doğrulayıcı faktör analizi yöntemi olarak güçlü ağırlıklı en küçük kareler yöntemi kullanılmıştır. Ölçek güvenirliğine kanıt sağlamak amacıyla Cronbach Alpha, Tabakalı Alfa, Armor Teta, McDonald Omega katsayıları hesaplanmıştır. Araştırma sonucunda üçlü derecelendirilen ve 12 maddeden oluşan ölçekten elde edilen verilerin geçerli ve güvenilir olduğu sonucuna ulaşılmıştır. Buna göre benzer örneklemler için ölçme aracının geçerli ve güvenilir sonuçlar vermesi beklenmektedir.Öğe Comparison of apical microleakage of dual-curing resin cements with fluid-filtration and dye extraction techniques(International Scientific Literature Inc., 2015) Kaya, Sadullah; Özer, Senem Yiğit; Adigüzel, Özkan; Oruçoğlu, Hasan; Değer, Yalçın; Tümen, Emin Caner; Uysal, İbrahimBackground: Endodontically treated teeth with excessive loss of tooth structure are frequently restored using fiber posts. In this in vitro study, the apical leakage of self- and dual-activated curing modes for dual-curing resins cementing a translucent fiber post was evaluated using computerized fluid filtration meter and dye extraction method. Material/Methods: One hundred and four extracted human maxillary incisors with single root and canal were used. Experimental samples embedded in a closed system were divided into 4 groups (n=20) according to 2 dual-curing luting systems, with 2 different curing modes (either with self- or light-activation): (1) Panavia F 2.0 with self-cure, (2) Panavia F 2.0 with light-activation, (3) Clearfill SA with self-cure, and (4) Clearfill SA with light activation. Twenty-four teeth served as negative and positive controls. Translucent fiber posts were luted in the roots except in the control groups. Results: Statistical analysis indicated no significant difference in leakage among groups (p>0.05) with 4.12×10–4 (Panavia self-cure), 4.55×10–4 (Clearfill SA self-cure), 5.17×10–4 (Panavia dual-cure), and 5.59×10–4 (Clearfill SA dual-cure) in fluid-filtration method. Absorbance values for dye-extraction method were 266 nanometer (nm) (Panavia self-cure), 268 nm (Clearfill SA self-cure), 270 nm (Panavia dual-cure), and 271 nm (Clearfill SA dual-cure), in which difference among the groups were not statistically significant (p>0.05). When comparing the leakage, assessment methods results showed no statistically significant difference between the tested evaluation techniques (p>0.05). Conclusions: Light- and self-activation curing modes of Panavia F 2.0 and Clearfill SA perform similar to each other in a closed system. © Med Sci Monit, 2015.Öğe Comparison of confirmatory factor analysis estimation methods on binary data(2020) Kılıç, Abdullah Faruk; Uysal, İbrahim; Atar, BurcuThis Monte Carlo simulation study aimed to investigate confirmatory factor analysis (CFA) estimation methods under different conditions, such as sample size, distribution of indicators, test length, average factor loading, and factor structure. Binary data were generated to compare the performance of maximum likelihood (ML), mean and variance adjusted unweighted least squares (ULSMV), mean and variance adjusted weighted least squares (WLSMV), and Bayesian estimators. As a result of the study, it was revealed that increased average factor loading and sample size had a positive effect on the performance of the estimation methods. According to the research findings, it can be said that the methods are sufficient to estimate average factor loading and interfactor correlations, regardless of the estimation methods, in most of the conditions where the average factor loading is 0.7. In small sample sizes particularly, the interfactor correlation was underestimated for skewed indicator conditions. According to the findings of the study, although there is not the most accurate method in all conditions, it can be recommended to use ULSMV method because it performs adequately in more conditions.Öğe Comparison of factor retention methods on binary data: A simulation study(Mehmet Tekerek, 2019) Kılıç, Abdullah Faruk; Uysal, İbrahimIn this study, the purpose is to compare factor retention methods under simulation conditions. For this purpose, simulations conditions with a number of factors (1, 2 [simple]), sample sizes (250, 1.000, and 3.000), number of items (20, 30), average factor loading (0.50, 0.70), and correlation matrix (Pearson Product Moment [PPM] and Tetrachoric) were investigated. For each condition, 1.000 replications were conducted. Under the scope of this research, performances of the Parallel Analysis, Minimum Average Partial, DETECT, Optimal Coordinate, and Acceleration Factor methods were compared by means of the percentage of correct estimates, and mean difference values. The results of this study indicated that MAP analysis, as applied to both tetrachoric and PPM correlation matrices, demonstrated the best performance. PA showed a good performance with the PPM correlation matrix, however, in smaller samples, the performance of the tetrachoric correlation matrix decreased. The Acceleration Factor method proposed one factor for all simulation conditions. For unidimensional constructs, the DETECT method was affected by both the sample size and average factor loading.Öğe Effect of Item Parameter Drift in Mixed Format Common Items on Test Equating(Ozgen Korkmaz, 2022) Uysal, İbrahim; Şahin-Kürşad, Merve; Kılıç, Abdullah FarukThe aim of the study was to examine the common items in the mixed format (e.g., multiple-choices and essay items) contain parameter drifts in the test equating processes performed with the common item non-equivalent groups design. In this study, which was carried out using Monte Carlo simulation with a fully crossed design, the factors of test length (30 and 50), sample size (1000 and 3000), common item ratio (30 and 40%), ratio of items with item parameter drift (IPD) in common items (20 and 30%), location of common items in tests (at the beginning, randomly distributed, and at the end) and IPD size in multiple-choice items (low [0.2] and high [1.0]) were studied. Four test forms were created, and two test forms do not contain parameter drifts. After the parameter drift was performed on the first of the other two test forms, the parameter drift was again performed on the second test form. Test equating results were compared using the root mean squared error (RMSE) value. As a result of the research, ratio of items with IPD in common items, IPD size in multiple-choice items, common item ratio, sample size and test length on equating errors were found to be significant. © 2022, Ozgen Korkmaz. All rights reserved.Öğe How reliable is it to automatically score open-ended items? An application in the Turkish language(Assoc Measurement & Evaluation Education & Psychology, 2021) Uysal, İbrahim; Doğan, NuriThe use of open-ended items, especially in large-scale tests, created difficulties in scoring open-ended items. However, this problem can be overcome with an approach based on automated scoring of open-ended items. The aim of this study was to examine the reliability of the data obtained by scoring open-ended items automatically. One of the objectives was to compare different algorithms based on machine learning in automated scoring (support vector machines, logistic regression, multinominal Naive Bayes, long-short term memory, and bidirectional long-short term memory). The other objective was to investigate the change in the reliability of automated scoring by differentiating the data rate used in testing the automated scoring system (33%, 20%, and 10%). While examining the reliability of automated scoring, a comparison was made with the reliability of the data obtained from human raters. In this study, which demonstrated the first automated scoring attempt of open-ended items in the Turkish language, Turkish test data of the Academic Skills Monitoring and Evaluation (ABIDE) program administered by the Ministry of National Education were used. Cross-validation was used to test the system. Regarding the coefficients of agreement to show reliability, the percentage of agreement, the quadratic-weighted Kappa, which is frequently used in automated scoring studies, and the Gwet's AC1 coefficient, which is not affected by the prevalence problem in the distribution of data into categories, were used. The results of the study showed that automated scoring algorithms could be utilized. It was found that the best algorithm to be used in automated scoring is bidirectional long-short term memory. Long-short term memory and multinominal Naive Bayes algorithms showed lower performance than support vector machines, logistic regression, and bidirectional long-short term memory algorithms. In automated scoring, it was determined that the coefficients of agreement at 33% test data rate were slightly lower comparing 10% and 20% test data rates, but were within the desired range.Öğe Madde Tepki Kuramı'na dayalı test eşitleme yöntemlerinin karma modeller üzerinde karşılaştırılması(Bolu Abant İzzet Baysal Üniversitesi, 2014) Uysal, İbrahim; Kilmen, SevilaySon yıllarda ikili ve çoklu şekilde puanlanan maddelerin aynı test üzerinde kullanımında artış görülmektedir. Türkiye'de ikili puanlanan maddeler üzerinde gerçekleştirilen test eşitleme çalışmalarına rastlansa da, karma modeller üzerinde yapılan herhangi bir test eşitleme çalışmasına rastlanmamıştır. Karma modeller kullanılarak gerçekleştirilen bu araştırmada, Madde Tepki Kuramı'na dayalı test eşitleme yöntemlerinden ortalama-ortalama, ortalama-standart sapma, sağlam ortalama-standart sapma, Haebara ve Stocking Lord'un çeşitli koşullar altında ortaya çıkardığı eşitleme hatalarının karşılaştırılması amaçlanmıştır. Ortak madde oranının (%10, %20 ve %30) ve yetenek dağılımının (normal, sola çarpık ve sağa çarpık) etkisinin incelendiği araştırma, 1000 kişilik örneklemde 40 maddelik test formları kullanılarak simulatif olarak gerçekleştirilmiştir. Eşitlenmemiş gruplar üzerinde ortak madde deseninin kullanıldığı araştırmada testler, ikili puanlanan maddeler için 3 parametreli model ve çoklu puanlanan maddeler için genelleştirilmiş kısmi kredi modeli kullanılarak WINGEN3 programıyla üretilmiştir. Aynı programla testi alan bireylere ait yetenek parametreleri belirlenmiş ve ortaya çıkan 9 farklı koşulun her biri için 100 replikasyon yapılmıştır. Simülatif verilere yönelik yetenek parametreleri PARSCALE 4.1 programıyla a likelihood ratio chi-square yöntemiyle kestirilmiştir. Ayrı kalibrasyon için test eşitleme, IRTEQ programı kullanılarak gerçekleştirilmiştir ve eşitleme hataları (RMSD) belirlenmiştir. Araştırmada elde edilen sonuçlar değerlendirildiğinde; genellikle en fazla hatayı sağlam ortalama-standart sapma yönteminin verdiği görülmüştür. Tüm koşullar değerlendirildiğinde en az hatayı sağa çarpık-sağa çarpık grupların eşitlendiği ve %30 ortak maddenin yer aldığı testlerde Stocking Lord yöntemi vermiştir. Bunun yanı sıra benzer yetenek dağılımlarının (normal-normal, sola çarpık-sola çarpık, sağa çarpık-sağa çarpık) farklı yetenek dağılımlarından (sola çarpık-normal, sağa çarpık-normal, sağa çarpık-sola çarpık) daha az hatalı sonuçlar verdiği bulunmuştur.Öğe Öğretmen Adaylarının Ölçme ve Değerlendirme Konusundaki Öz-Yeterlik Algılarının İncelenmesi(2013) Kürşad, Merve Şahin; Uysal, İbrahimBu çalışmanın amacı, öğretmen adaylarının ölçme ve değerlendirme konusundaki özyeterlik algılarını cinsiyet, sınıf, bölüm ve öğretim durumu değişkenleri açısından incelemektir. Tarama modelinin kullanıldığı bu çalışmada, araştırma grubunu 2012-2013 eğitim-öğretim yılı bahar döneminde Batı Karadeniz Bölgesindeli bir üniversitenin Eğitim Fakültesinde öğrenim görmekte olan 181 öğretmen adayı oluşturmaktadır. Çalışmada veri toplama aracı olarak Kılınç (2011)'ın geliştirdiği "Öğretmen Adaylarının Eğitimde Ölçme ve Değerlendirmeye Yönelik Öz Yeterlik Algı Ölçeği" kullanılmıştır. Verilerin analizinde cinsiyet, öğretim durumu (I. ve II. öğretim) ve sınıf düzeyi faktörlerinin ölçme ve değerlendirme konusundaki öz-yeterliğe etkisi bağımsız örneklemler için t testi; sınıf değişkeninin ölçme ve değerlendirme konusundaki öz-yeterliğe etkisi ise tek yönlü Anova ile incelenmiştir. Araştırma sonucunda öğretmen adaylarının ölçme ve değerlendirmeye yönelik öz-yeterlik algılarının cinsiyet, öğretim durumu, sınıf değişkeni ve bölüm değişkenlerine göre istatistiksel olarak önemli bir farklılık göstermediği sonucuna ulaşılmıştırÖğe Pedagojik formasyon programı öğrencilerinin genel öz yeterlik ve epistomolojik inançlarının karşılaştırmalı incelemesi(2013) Uysal, İbrahim; Kösemen, SelinBu çalışmanın amacı pedagojik formasyon programı öğrencilerinin genel öz yeterlikleri ile epistemolojik inançları arasındaki ilişkiyi belirlemektir. İlişkisel tarama modelinin kullanıldığı bu çalışmada araştırma grubunu Ege Bölgesi‘ndeki bir üniversitede öğrenim görmekte olan 83 ve Batı Karadeniz Bölgesi‘ndeki bir üniversitede öğrenim görmekte olan 91 pedagojik formasyon programı öğrencisi oluşturmaktadır. Çalışmada veri toplama araçları olarak Shwarzer ve Jerusalem (1995) tarafından geliştirilen ve Aypay (2010) tarafından Türk kültürüne uyarlanan Genel Öz-Yeterlik Ölçeği ile Schommer (1990) tarafından geliştirilen Deryakulu ve Büyüköztürk (2002) tarafından Türk kültürüne uyarlanan Epistemolojik İnanç Ölçeği kullanılmıştır. Nicel veriler 2012-2013 öğretim yılı bahar döneminde toplanmıştır. Verilerin analizinde Bağımsız Örneklemler için t-Testi, Mann-Whitney U-Test ve Spearman korelasyon katsayısı tekniği kullanılmıştır. Araştırma sonucunda pedagojik formasyon programı öğrencilerinin genel özyeterlikleri ile epistemolojik inancın alt boyutu olan “öğrenmenin çabaya bağlı olduğuna yönelik inanç” arasında, negatif yönde ve orta düzeyde bir ilişki bulunmuştur. Pedagojik formasyon programı öğrencilerinin genel öz-yeterlikleri, üniversiteye göre anlamlı bir Şekilde farklılık gösterirken cinsiyete göre anlamlı bir farklılık bulunmamıştır. Epistemolojik inancın alt boyutlarından “öğrenmenin çabaya bağlı olduğuna yönelik inanç” cinsiyete göre anlamlı bir farklılık göstermezken üniversiteye göre anlamlı bir Şekilde değişmiştir. Epistemolojik inancın diğer alt boyutu olarak belirlenen “öğrenmenin yeteneğe bağlı olduğuna yönelik inanç ve tek bir doğrunun var olduğuna yönelik inanç” ise cinsiyete göre anlamlı bir farklılık gösterirken üniversiteye göre anlamlı bir şekilde değişmemiştir.Öğe Performances based on ability estimation of the methods of detecting differential item functioning: A Simulation study(2019) Uysal, İbrahim; Ertuna, Levent; Ertaş, F. Güneş; Kelecioğlu, HülyaThe aim of the study is to examine differential item functioning (DIF) detection methods—the simultaneous item bias test (SIBTEST), Item Response Theory likelihood ratio (IRT-LR), Lord chi square (?2), and Raju area measures—based on ability estimates when purifying items with DIF from the test, considering conditions of ratio of the items with DIF, effect size of DIF, and type of DIF. This study is a simulation study and 50 replications were conducted for each condition. In order to compare DIF detection methods, error (RMSD) and coefficient of concordance (Pearson’s correlation coefficient) were calculated according to estimated and initial abilities for the reference group. As a result of the study, the lowest error and the highest concordance were seen in the case of 10% uniform DIF in the test and the method of IRT-LR, considering all other conditions. Moreover, for the method of SIBTEST and IRT-LR in all conditions, it was found that the error obtained by purifying items with C level DIF is lower than the error obtained by purifying items with both B and C level DIF. Similarly, for the method of SIBTEST and IRT-LR in all conditions, it was seen that the concordance coefficient found by purifying C level DIF is higher than the coefficient by purifying items with both B and C level DIFÖğe A phenomenological study on the effectiveness of curriculum and course ınformation packages in the bologna process(2019) Fer, Seval; Kelecioğlu, Hülya; Aktan, Derya Çobanoğlu; Burakgazi, Sevinç; Özdiyar, Özlenen; Uysal, Funda; Uysal, Nermin Kıbrıslıoğlu; Uysal, İbrahimThe aim of the present study is to analyze the School of Education and Department of Educational Sciences’ curricula (program and course information packages) with respect to the Bologna process. Designed in line with phenomenology, the study focuses on the phenomenon of “the effectiveness of curricula with respect to the Bologna process”. The data were collected by interviewing two separate focus groups of students and lecturers, and analyzed by using Miles and Huberman’s stages. The results of the three research questions are explained in terms of preparation, implementation, follow-up and revision, and quality assurance. The results are as follows: Considering the positive aspects of the process, both lecturers and students agreed that the process eased access to information and course selection with the help of elective courses; however, all participants complained about the lack of information flow, unclear tasks and process, disbelief in the importance of the process, resistance to the preparation process, unfair work distribution, and the mismatch between competencies and courses. Lecturers also mentioned problems related to the revision and feedback processes.Öğe The Pre-Diagnostic Test of Visual Perception for the Field of Painting Talent in Science and Art Centers: A Pilot Study(T.C. Milli Egitim Bakanligi, 2023) Göksu, Seçkin; Eren, Altay; Erİştİ, Suzan Duygu; Uysal, İbrahim; Erol, Songül EsinThe aim of the study was to develop a ‘visual perception test’ that would be utilized in the pre-diagnosis phase of the field of painting talent in science and art centers. A total of 806 elementary school students constituted the sample of the study. An exploratory correlation design was adopted in the study. A series of Bayesian confirmatory factor analyses were conducted to examine the structural validity of the test. Internal consistency of the test was investigated by computing the coefficients of McDonald’s omega, standardized alpha, and the greatest lower bound for each factor and for the test as a whole. Additionally, it was also examined whether the test items showed differential item functioning across gender and date of birth of the students along with the response times on the test items. The findings showed that the test could be described by both the first-order factors entitled visual recall, visual association, visual distinctiveness, visual-spatial perception, and visual pattern recognition, and a second-order factor entitled ‘visual perception’. The findings further demonstrated that the test items showed no significant differential item functioning across gender and date of birth, and that letting the students 60 seconds for each item was unnecessary. © 2023, Milli Egitim. All Rights Reserved.Öğe Scale development study about the effect of cartoon movies on affective characteristics of elementary school students(Ankara University, 2018) Uysal, İbrahim; Sarıça, SelinSince the effects of cartoon movies on the affective characteristics of elementary school students have not been investigated very much in the past, no measuring tool has been found to be valid and reliable. The aim of the research in this direction is to develop a scale aimed at determining the effects of cartoon movies on the affective characteristics of elementary school students. The study group of the research is composed of 234 elementary school students in Afyonkarahisar city and 220 elementary school students in Aydın city. Item scale correlations were calculated for item validity in the scale development phase and item analyzes based on the comparison of the average of the lower and the upper group were performed. An exploratory factor analysis and confirmatory factor analysis were performed to determine the construct validity of the scale. In the exploratory factor analysis, the principal component analysis was chosen as the factor subtraction method. As there is a relation between dimensions in the scale which was observed to be a two-dimensional structure, PROMIN has been chosen as oblique rotation method. Robust weighted least squares method was used as the confirmatory factor analysis method. Cronbach Alpha, Stratified Alpha, Armor Teta and McDonald Omega coefficients were calculated to provide evidence of scale reliability. As a result of the research, it is concluded that the data obtained from three-graded scale which consists of 12 items are valid and reliable. Accordingly, it can be expected that the measurement tool will provide valid and reliable results for similar samples. © 2019, Ankara University. All rights reserved.Öğe To what extent are item discrimination values realistic? A new index for two-dimensional structures(Ijate-Int Journal Assessment Tools Education, 2022) Kılıç, Abdullah Faruk; Uysal, İbrahimMost researchers investigate the corrected item-total correlation of items when analyzing item discrimination in multi-dimensional structures under the Classical Test Theory, which might lead to underestimating item discrimination, thereby removing items from the test. Researchers might investigate the corrected item-total correlation with the factors to which that item belongs; however, getting a general overview of the entire test is impossible. Based on this problem, this study aims to recommend a new index to investigate item discrimination in two-dimensional structures through a Monte Carlo simulation. The new item discrimination index is evaluated by identifying sample size, item discrimination value, inter-factor correlation, and the number of categories. Based upon the results of the study it can be claimed that the proposed item discrimination index proves acceptable performance for two-dimensional structures. Accordingly, using this new item discrimination index could be recommended to researchers when investigating item discrimination in two-dimensional structures.