Seminars

CMT Weekly Seminars

Spring 2017 Seminars

The Georgia Tech Center for Music Technology Spring Seminar Series features both invited speakers as well as second-year student project proposal presentations. The seminars are on Mondays from 2:05-2:55 p.m. in the Van Leer Building (Room C457) on Georgia Tech's campus and are open to the public. Below is the schedule for invited speakers and student presentations for spring 2017:

January 9: Mike Winters, Center for Music Technology Ph.D. student

View Abstract

Working with human participants is an important part of evaluating your work. However, it is not always easy to know what is ethical and not as several factors must be considered. In this talk, I will discuss ethical issues of using human participants for research from the eBelmont Report to the submitting an IRB. I will also consider the ethical issues in the projects I have worked on in the past year including a system for Image Accessibility.

January 23: Mark Riedl, an associate professor in the School of Interactive Computing and director of the Entertainment Intelligence Lab

View Abstract

Computational creativity is the art, science, philosophy, and engineering of computational systems that exhibit behaviors that unbiased observers would deem to be creative. We have recently seen growth in the use of machine learning to generate visual art and music. In this talk, I will overview my research on generating playable computer games. Unlike art and music, games are dynamical systems where the the user chooses how to engage with the content in a virtual world, posing new challenges and opportunities. The presentation will cover machine learning for game level generation and story generation as well as broader questions of defining creativity.

January 30: Lisa Margulis, professor and director of the Music Cognition Lab at the University of Arkansas

View Abstract

This talk introduces a number of behavioral methodologies for understanding the kinds of experiences people have while listening to music. It explores the ways these methodologies can illuminate experiences that are otherwise difficult to talk about. Finally, it assesses the potential and the limitations of using science to understand complex cultural phenomena.

February 6: Martin Norgaard, assistant professor of music education at Georgia State University

View Abstract

In our recent pilot study, middle school concert band students who received instruction in musical improvisation showed far-transfer enhancements in some areas of executive function related to inhibitory control and cognitive flexibility compared to other students in the same ensemble. Why does improvisation training enhance executive function over and above standard music experience? Music improvisation involves the ability to adapt and integrate sounds and motor movements in real-time, concatenating previously stored motor sequences in order to flexibly produce the desired result, in this case, a particular auditory experience. The output of improvisation must then be evaluated by the musician in real time based on internal goals and the external environment, which may lead to the improviser modifying subsequent motor acts. I explore how developing these processes could cause the observed far-transfer effects by reviewing our previous qualitative and quantitative research as well as significant theoretical frameworks related to musical improvisation.

February 13: Chris Howe, Project Engineer at Moog Music

View Abstract

Chris Howe is a project engineer at Moog Music where he helps create new musical tools to inspire creativity. He will be discussing his role as embedded systems designer on the Global Modular project, a collaboration with artist Yuri Suzuki which explores globalization through crowd-sourced sampling, convolution reverb, and spectral morphing.

February 20: Michael Casey, Professor of Music and Computer Science at Dartmouth

View Abstract

Our goal is to build brain-computer interfaces that can capture the sound in the mind's ear and render it for others to hear. While this type of mind reading sounds like science fiction, recent work by computer scientists and neuroscientists (Nishimoto et al., 2011; Haxby et al., 2014) has shown that visual features corresponding to subjects' perception of images and movies can be predicted from brain imaging data alone (fMRI). We present our research on learning stimulus encoding models of music audio from human brain imaging, for both perception and imagination of the stimuli (Casey et al., 2012; Hanke et al., 2015; Casey 2017). To encourage further development of such neural decoding methods, the code, stimuli, and high-resolution 7T fMRI data from one of our experiments have been publicly released via the OpenfMRI initiative.

Prof. Casey and Neukom Fellow Dr. Gus Xia will also discuss the Neukom Institute's 2017 Turing Test in Human-Computer Music Interaction, comprising several performance tasks in instrumental music and dance. Competitors are asked to create artificial performers capable of performing “duets” with human performers, possibly in real time.

                      Gus Xia, Neukom Postdoc Fellow at Dartmouth

View Abstract Abstract: Expressive Human-Computer Music Interaction In this talk, Gus will present various of techniques to incorporate automatic accompaniment system with musical expression, including nuance timing and dynamics deviations, humanoid robotic facial and gestural expression, and basic improvisation techniques. He will also promote 2017 "Turing Test for Creative Art", which is initialized at Dartmouth college and this year contains a new track on Human-computer music performance. For more information, please visit http://bregman.dartmouth.edu/turingtests/.

February 27: Roxanne Moore, Research Engineer II at Georgia Tech

View Abstract

There are a lot of ideas out there about how to "fix" education in the United States, particularly in K-12. However, new innovations are constantly met with the age-old question: Does it work? Different stakeholders have different definitions of what it means to "work" and each of those definitions has unique measurement and assessment challenges. In this talk, we'll look at different ways of answering the "Does it work?" question in the context of different education innovations that I've personally worked on. We'll look at the innovations themselves and the research methods used to assess whether or not those innovations "work." We'll also take a complex systems view of schools, including some systems dynamics models of school settings, to better understand the challenges and opportunities in K-12 education.

March 6: Klimchak, Artist

View Abstract

Klimchak will discuss his musical compositional methods, which involve the intersection of home-built instruments, low- and high-tech sound manipulation, and live performance. He will perform 2 pieces, WaterWorks (2004) for a large bowl of amplified water, and Sticks and Tones (2016) for frame drum, melodic and laptop.

March 13—Annie Zhan, Software Engineer at Pandora

View Abstract

Music technology has been playing a more and more important role in academic and industrial research and developments. At Pandora, we conduct lots of research around intelligent systems, machine listening, and recommendation systems. How is music information retrieval used in industrial companies? What are the key successes and challenges? This talk will cover several of my graduate research projects around MIR (music mood detection, the Shimi band), and audio fingerprinting duplicate detection system, music recommendation systems developed at Pandora.

March 27—Avrosh Kumar, Nikhil Bhanu

View Abstracts

Avrosh's Abstract: The focus of this project is to develop a DAW (digital audio workstation) interface to aid audio mixing in virtual reality. The application loads an Ableton Live session and creates a representation of virtual reality, taking advantage of the depth and wider field of vision. This provides a way for audio engineers to look at the mix, visualize panning and frequency spectra from a new perspective and interact with the DAW controls using gestures.

Nikhil's Abstract: Astral Plane is an object-based spatial audio system for live performances and improvisation. It employs Higher-Order Ambisonics and is built using Max/MSP with Ableton Live users in mind. The core idea is to create and apply metadata to sound objects (audio tracks in Live) in real-time, at signal rate. This includes object origin, position, trajectory, speed of motion, mappings etc. The novel features include interactive touch & gesture control via an iPad interface, continuous/one-shot geometric trajectories & patterns, sync with Live via Ableton Link and automatic spatialization driven by audio features extracted in real-time. The motivations are to explore the capability of stationary/moving sounds in 2D space and to assess the perceptibility of various trajectories, interaction paradigms in terms of musicality. The aim is to enable artists and DJs to engage in soundscape composition, build/release tension and storytelling. Project source and additional information is available on GitHub.

April 3—Shi Cheng, Hua Xiao

April 10—Milap Rane, Sirish Satyavolu

April 17—Jonathan Wang, Shijie Wang

View Abstracts

Jonathan's Abstract: The focus of this project in vocal acoustics is vocal health and studying the effects of vocal disorders on the acoustic output of the human voice. For many professionals, growths on the vocal folds alter their oscillatory motion and ultimately affect the sound of their voice as well as their health. However, most people with voice disorders do not seek medical attention or treatment. My project aims to create a preliminary diagnosis tool by comparing the recording of a patient’s voice with other voice recordings.

April 24—Brandon Westergaard, Amruta Vidwans

View Abstracts

Amruta’s Abstract: Dereverberation is an important pre-processing step for audio signal processing. It is critical step for speech recognition and music information retrieval (MIR) tasks. It has been a well researched topic in case of speech signals but these methods cannot be directly applied to music signals. In the previous semester evaluation of existing speech based dereverberation algorithms on music signals was carried out. In this semester the focus is towards using machine learning to perform music dereverberation. This project will be useful for MIR tasks and for audio engineers to obtain dry recordings.

May 1—Tyler White, Lea Ikkache

View Abstracts

Tyler's Abstract: My project is Robotic Drumming Third Arm. The main goals and motivations are to explore how a shared control paradigm between a human drummer and wearable robotics can influence and potentially enhance a drummer’s performances and capabilities. A wearable system allows us to examine interaction beyond the visual and auditory that is explored in non-wearable robotic systems such as Shimon or systems that attach actuators directly to the drums. My contributions to this project have been a sensor fusion system, data filtering, and smoothing methods, designed and fabricated a custom PCB, created a custom firmware and hardware to communicate via from the Arduino to Max/MSP, advanced stabilization techniques for two moving bodies, and high-level musical interactivity programs for performance. Watch a short video of the project here.

Lea's Abstract: This project revolves around a sound exhibition called Memory Palace. The application uses indoor localization systems to create a 3D sound "library" in the exhibition space. Users, with their smartphones, can, therefore, record sounds (musical or words) and place them in space. When their phone hovers around a sound someone has placed, it will play the sound. This application, based on web-audio and whose development started at IRCAM, aims to make users reflect on the subject of memory, and play with sounds and space.