Uncategorized

All Your Rankings Are Belong to Us

A. Waters, D. Tinapple, and R. G. Baraniuk, "BayesRank: A Bayesian Approach to Ranked Peer Grading," ACM Conference on Learning at Scale, Vancouver, March 2015.

Abstract: Advances in online and computer supported education afford exciting opportunities to revolutionize the classroom, while also presenting a number of new challenges not faced in traditional educational settings. Foremost among these challenges is the problem of accurately and efficiently evaluating learner work as the class size grows, which is directly related to the larger goal of providing quality, timely, and actionable formative feedback. Recently there has been a surge in interest in using peer grading methods coupled with machine learning to accurately and fairly evaluate learner work while alleviating the instructor bottleneck and grading overload. Prior work in peer grading almost exclusively focuses on numerically scored grades – either real-valued or ordinal. In this work, we consider the implications of peer ranking in which learners rank a small subset of peer work from strongest to weakest, and propose new types of computational analyses that can be applied to this ranking data. We adopt a Bayesian approach to the ranked peer grading problem and develop a novel model and method for utilizing ranked peer-grading data. We additionally develop a novel procedure for adaptively identifying which work should be ranked by particular peers in order to dynamically resolve ambiguity in the data and rapidly resolve a clearer picture of learner performance. We showcase our results on both synthetic and several real-world educational datasets.

The figures below compares BayesRank to the known ground truth item ordering in a synthetic experiment using Kendall’s tau metric, which measures the general agreement between two ordered sets. Kendall’s tau metric looks at each pair in one ranking and compares the same items in the second ranking to check for consistency. D_tau = +1/-1 corresponds to perfect agreement/disagreement. The two curves correspond to BayesRank with observations generated using adaptive assignment (red) and random item assignment (blue); we plot the tau metric as a function of the size of the class N (left) and as a function of the number of items K assigned to each grader. In all cases, BayesRank random assignment achieves significantly better performance than traditional random assignment.

IEEE Signal Processing Society Technical Achievement Award

Richard Baraniuk, the Victor E. Cameron Professor of Electrical and Computer Engineering at Rice University, has been named recipient of the 2014 IEEE Signal Processing Society Technical Achievement Award for "contributions to the theory and applications of sparsity and compressive sensing."

The Technical Achievement Award honors a person who, over a period of years, has made outstanding technical contributions to theory and/or practice as demonstrated by publications, patents, or recognized impact on the field. The award will be presented at ICASSP 2015 in Brisbane, Australia, where Baraniuk is also a plenary speaker presenting his work on machine learning for open education.

IEEE James H. Mulligan Jr. Education Medal

Richard Baraniuk, the founder and director of OpenStax and Rice University's Victor E. Cameron Professor of Electrical and Computer Engineering, has been named recipient of the 2015 IEEE James H. Mulligan Jr. Education Medal.

The medal, presented annually since 1956 by the former Institute of Electrical and Electronics Engineers, is given for outstanding contributions to education in the fields of interest to the organization. Baraniuk was specifically cited for his "fundamental contributions to open educational resources for electrical engineering and beyond." The medal will be presented at the IEEE Honors Ceremony June 20 in New York City. Mulligan is a former president of IEEE.

Baraniuk founded Rice-based Connexions in 1999 to bring textbooks and other learning materials to the Internet. OpenStax grew from that effort as a way to provide high-quality, peer-reviewed, college-level textbooks to students worldwide. The books are free to download and can be printed at low cost. The OpenStax library includes textbooks for college physics, sociology, economics, anatomy and physiology, and separate biology books for majors and nonmajors. Since its launch in 2012, OpenStax books have been selected for use by more than 1,100 college courses and downloaded nearly a million times, saving students more than $30 million.

Who’s Working with Whom?

A. Waters, C. Studer, and R. G. Baraniuk, "Collaboration-Type Identification in Educational Datasets," Journal of Educational Data Mining, Vol. 6, No. 1, 2014.

Abstract: Identifying collaboration between learners in a course is an important challenge in education for two reasons: First, depending on the courses rules, collaboration can be considered a form of cheating. Second, it helps one to more accurately evaluate each learners competence. While such collaboration identification is already challenging in traditional classroom settings consisting of a small number of learners, the problem is greatly exacerbated in the context of both online courses or massively open online courses (MOOCs) where potentially thousands of learners have little or no contact with the course instructor. In this work, we propose a novel methodology for collaboration-type identification, which both identifies learners who are likely collaborating and also classifies the type of collaboration employed. Under a fully Bayesian setting, we infer the probability of learners succeeding on a series of test items solely based on graded response data. We then use this information to jointly compute the likelihood that two learners were collaborating and what collaboration model (or type) was used. We demonstrate the efficacy of the proposed methods on both synthetic and real-world educational data; for the latter, the proposed methods find strong evidence of collaboration among learners in two non-collaborative take-home exams.

Below we show a collaboration-type identification result using Bayesian model selection for a collection of homework assignments in an undergraduate signal processing class. The data consists of 38 learners answering 50 homework questions plus 14 midterm exam questions. Grey ellipses designate the assigned homework groups. Dashed green lines denote parasitic collaborations, while solid blue lines denote symbiotic collaborations. Dotted red lines denote the connections found using Wesolowsky’s method, which, in general, finds fewer ground truth connections than the our method.

Gates Foundation Backs Adaptive Textbook Project

Rice University-based nonprofit OpenStax, which has already provided free textbooks to hundreds of thousands of college students, has been chosen by the Bill & Melinda Gates Foundation to develop personalized courseware for college students as part of the foundation’s $20 million Next Generation Courseware Challenge. The initiative was announced last week at the 2014 Educause Conference in Orlando.

“The same technology that retailers and online search firms use to deliver personalized choices can be combined with cognitive science models to deliver personalized learning experiences for college students,” said OpenStax founder Richard Baraniuk, Rice’s Victor E. Cameron Professor of Engineering. “Our free textbooks are already making college more accessible for those who couldn’t otherwise afford it, and personalized learning technology will improve student learning outcomes as well.”

NIPS Workshop on "Human Propelled Machine Learning"

In typical applications of machine learning (ML), humans typically enter the process at an early stage, in determining an initial representation of the problem and in preparing the data, and at a late stage, in interpreting and making decisions based on the results. Consequently, the bulk of the ML literature deals with such situations. Much less research has been devoted to ML involving “humans-in-the-loop,” where humans play a more intrinsic role in the process, interacting with the ML system to iterate towards a solution to which both humans and machines have contributed. In these situations, the goal is to optimize some quantity that can be obtained only by evaluating human responses and judgments. Examples of this hybrid, “human-in-the-loop” ML approach include:

ML-based education, where a scheduling system acquires information about learners with the goal of selecting and recommending optimal lessons;
Adaptive testing in psychological surveys, educational assessments, and recommender systems, where the system acquires testees’ responses and selects the next item in an adaptive and automated manner;
Interactive topic modeling, where human interpretations of the topics are used to iteratively refine an estimated model;
Image classification, where human judgments can be leveraged to improve the quality and information content of image features or classifiers.

In this workshop in December 2014, we focused on the emerging new theories, algorithms, and applications of human-in-the-loop ML algorithms.

Workshop web page with speakers and their slides

More information about the NIPS conference

10 More OpenStax College Titles On Their Way

Rice University-based publisher OpenStax College today announced $9.5 million in philanthropic grants from the Laura and John Arnold Foundation (LJAF), Rice alumni John and Ann Doerr, and the William and Flora Hewlett Foundation to add 10 titles to its catalog of free, high-quality textbooks for the nation’s most-attended college courses by 2017. OpenStax College is creating free books for 25 of the most-attended college courses in the country.

OpenStax College uses philanthropic gifts to produce high-quality, peer-reviewed textbooks that are free online and low-cost in print. Its first seven books have already saved students more than $13 million. The books have been downloaded more than 650,000 times and have been adopted for use in nearly 900 courses at community colleges, four-year colleges, universities and high schools. OpenStax College has four titles in production for next year and plans to expand its library to 21 titles by 2017. The additional funding will allow the nonprofit publisher to develop textbooks for additional high-enrollment courses, including several science and mathematics courses.

“Our books are opening access to higher education for students who couldn’t otherwise afford it,” said Rice Professor Richard Baraniuk, founder and director of OpenStax College. “We’ve already saved students millions of dollars, and thanks to the generosity of our philanthropic partners, we hope to save students more than $500 million by 2020.”

From Denoising to Compressed Sensing

C. A. Metzler, A. Maleki, and R. G. Baraniuk, "From Denoising to Compressed Sensing," July 2014. arXiv version

Abstract: A denoising algorithm seeks to remove perturbations or errors from a signal. The last three decades have seen extensive research devoted to this arena, and as a result, today's denoisers are highly optimized algorithms that effectively remove large amounts of additive white Gaussian noise. A compressive sensing (CS) reconstruction algorithm seeks to recover a structured signal acquired using a small number of randomized measurements. Typical CS reconstruction algorithms can be cast as iteratively estimating a signal from a perturbed observation. This paper answers a natural question: How can one effectively employ a generic denoiser in a CS reconstruction algorithm? In response, in this paper, we develop a denoising-based approximate message passing (D-AMP) algorithm that is capable of high-performance reconstruction. We demonstrate that, for an appropriate choice of denoiser, D-AMP offers state-of-the-art CS recovery performance for natural images. We explain the exceptional performance of D-AMP by analyzing some of its theoretical features. A critical insight in our approach is the use of an appropriate Onsager correction term in the D-AMP iterations, which coerces the signal perturbation at each iteration to be very close to the white Gaussian noise that denoisers are typically designed to remove.

The figure below illustrates reconstructions of the 256x256 Barbara test image (65536 pixels) from 6554 randomized measurements. Exploiting the state-of-the-art BM3D denoising algorithm in D-AMP enables state-of-the-art CS recovery.

Adaptive Textbook Project Launched

Rice University-based nonprofit OpenStax, which has already provided free textbooks to hundreds of thousands of college students, today announced a $9 million effort supported by the Laura and John Arnold Foundation to develop free, digital textbooks capable of delivering personalized lessons to high school students.

"Using advanced machine learning algorithms and new models from cognitive science, we can improve educational outcomes in a number of ways,” said project founder Richard Baraniuk. “We can help teachers and administrators by tapping into metrics that they already collect — like which kind of homework and test questions a student tends to get correct or incorrect — as well as things that only the book would notice — like which examples a student clicks on, how long she stays on a particular illustration or which sections she goes back to reread.”

The technology will pinpoint areas where students need more assistance, and it will react by delivering specific content to reinforce concepts in those areas. The personalized books will deliver tailored lessons that allow individual students to learn at their own pace. For fast learners, lessons might be streamlined and compact; for a struggling student, lessons might include supplemental material and additional learning exercises.

Thomson Reuters’ Highly Cited Researcher for 2014

Richard Baraniuk was one of 3,215 researchers in the sciences and social sciences who authored papers that ranked among the top 1% most cited for their subject field and year of publication in Thomson Reuters’ academic citation indexing and search service, Web of Knowledge.
More info available here and here.