Publications | Madhusudana Shashanka

Madhusudana Shashanka
Founder, Chief Scientist

Home | Publications

List of Journal and Conference Papers, Posters and Abstracts, Technical Reports, Media Articles, Talks and Patents

Show/Hide All

PhD Dissertation - Show/Hide
1. Latent Variable Framework for Modeling and Separating Single Channel Acoustic Sources.
  M Shashanka.
  Department of Cognitive and Neural Systems, Boston University, August 2007. [ pdf ] [ Abstract @ UMI ] [ Citations ]
  Defense Slides. [ pdf ] [ ppt ]
  Abstract
  
  Auditory Scene Analysis refers to the human ability to extract different perceptual objects from a sound mixture. Replicating this ability in artificial systems has been an active area of research, related both to how one characterizes acoustic sources and separates sources from mixtures. The focus of this thesis is to develop models and algorithms that provide a framework to address these questions. The framework comprises latent variable models that employ hidden variables to model unobservable quantities. Such models are appropriate for obtaining representations of data that make hidden structure explicit. This work shows how one can utilize these ideas for the problem of source separation using single-channel audio signals.
  
  The proposed framework focuses on learning the time-frequency (TF) structure in a data-driven manner. TF representations of sounds are modeled by treating the energy in every TF bin as histogram counts of multiple draws. This formulation allows the extraction of the characteristic frequency structure of individual sources as latent components and models the sources as additive combinations of these components. The framework is then extended to incorporate the idea of sparse coding to overcome an important limitation of the basic model: an upper bound on the number of extractable components. Sparsity, imposed in the form of an entropic prior distribution, allows extraction of overcomplete sets of components that are more expressive and better characterize the sources. The statistical foundation of the framework makes it amenable to other extensions where known or hypothesized structure about the data can be easily incorporated by imposing appropriate prior distributions. Theoretical analysis of the proposed methods and algorithms for parameter inference are presented.
  
  Applications of the models to real-world problems are evaluated and discussed. The latent components learned from acoustic sources are used in a supervised setting for source separation and in a semi-supervised setting for denoising. Unlike approaches based on time-frequency masks that reconstruct partial spectral descriptions of sources by identifying time-frequency bins in which a source dominates, this approach reconstructs entire spectral descriptions of all sources. Various experimental results demonstrate the utility of the proposed framework.
Book/Journal Papers - Show/Hide
1. Advanced Building Energy Management System Demonstration for Department of Defense Buildings.
  Z O'Neill, T Bailey, B Dong, M Shashanka, D Luo.
  Annals of the New York Academy of Sciences, Vol. 204, pp 44-53, 2013. [ DOI ] [ Citations ]
2. Model-based Real-time Whole Building Energy Performance Monitoring and Diagnostics.
  Z O'Neill, X Pang, M Shashanka, P Haves, T Bailey.
  Journal of Building Performance Simulation, 2013. [ DOI ] [ Citations ]
3. Simplex Decompositions using Singular Values Decomposition.
  M Shashanka, M Giering.
  Pattern Recognition - Applications and Methods, Springer Series on Advances in Intelligent Systems and Computing, Vol. 204, 2013. [ DOI ] [ Citations ]
4. Missing Data Imputation for Time-Frequency Representations of Audio Signals.
  P Smaragdis, B Raj, M Shashanka.
  Journal of Signal Processing Systems, Vol. 65, No. 3, 2011, p361-370. [ DOI ] [ pdf ] [ Citations ]
  Abstract
  With the recent attention towards audio processing in the time-frequency domain we increasingly encounter the problem of missing data within that representation. In this paper we present an approach that allows us to recover missing values in the time-frequency domain of audio signals. The presented approach is able to deal with real-world polyphonic signals by operating seamlessly even in the presence of complex acoustic mixtures. We demonstrate that this approach outperforms generic missing data approaches, and we present a variety of situations that highlight its utility.
5. Probabilistic Latent Variable Models as Non-Negative Factorizations.
  M Shashanka, B Raj, P Smaragdis.
  Computational Intelligence and Neuroscience, May 2008. [ DOI ] [ pdf ] [ Citations ]
  Abstract
  This paper presents a family of probabilistic latent variable models that can be used for analysis of nonnegative data. We show that there are strong ties between nonnegative matrix factorization and this family, and provide some straightforward extensions which can help in dealing with shift invariances, higher-order decompositions and sparsity constraints. We argue through these extensions that the use of this approach allows for rapid development of complex statisticalmodels for analyzing nonnegative data.
6. A Framework for Secure Speech Recognition.
  P Smaragdis, M Shashanka.
  IEEE Transactions on Audio, Speech and Language Processing, Vol 15, No. 4, May 2007, p1404-1413. [ DOI ] [ pdf ] [ Citations ]
  Abstract
  In this paper, we present a process which enables privacy-preserving speech recognition transactions between two parties. We assume one party with private speech data and one party with private speech recognition models. Our goal is to enable these parties to perform a speech recognition task using their data, but without exposing their private information to each other. We will demonstrate how using secure multiparty computation principles we can construct a system where this transaction is possible, and how this system is computationally and securely correct. The protocols described herein can be used to construct a rudimentary speech recognition system and can easily be extended for arbitrary audio and speech processing.
7. Optimal Skewed Data Allocation on Multiple Channels with Flat Broadcast per Channel.^*
  E Ardizzoni, AA Bertossi, MC Pinotti, S Ramaprasad, R Rizzi, M Shashanka.
  IEEE Transactions on Computers, Vol 54, No. 5, May 2005, p558-572. [ DOI ] [ pdf ] [ Citations ]
  ^*Authors listed in alphabetical order.
  Abstract
  Broadcast is an efficient and scalable way of transmitting data to an unlimited number of clients that are listening to a channel. Cyclically broadcasting data over the channel is a basic scheduling technique, which is known as flat scheduling. When multiple channels are available, a data allocation technique is needed to assign data to channels. Partitioning data among channels in an unbalanced way, depending on data popularities, is an allocation technique known as skewed allocation. The problem of data broadcasting over multiple channels is considered, assuming skewed data allocation to channels and flat data scheduling per channel, with the objective of minimizing the average waiting time of the clients. First, several algorithms, based on dynamic programming, are presented which provide optimal solutions for N data items and K channels. Specifically, for data items with uniform lengths, an O(NK log N) time algorithm is proposed, which improves over the previously known O(N/sup 2/K) time algorithm. When K/spl les/4, a simpler O(N log N) time algorithm is exhibited which requires only O(N) time if the data items are sorted. Moreover, for data items with nonuniform lengths, it is shown that the problem is NP-hard when K=2 and strong NP-hard for arbitrary K. In the former case, a pseudopolynomial algorithm is discussed whose time is O(NZ), where Z is the sum of the data lengths. In the latter case, an algorithm is devised with time exponential in the maximum data length, which can optimally solve, in reasonable time, only small instances. For larger instances, a new heuristic is devised which is experimentally tested on some benchmarks whose popularities are characterized by Zipf distributions. Such experimental tests reveal that the new heuristic proposed here always outperforms the best previously known heuristic in terms of solution quality.
8. A Characterisation of Optimal Channel Assignments for Cellular and Square Grid Wireless Networks.
  M Shashanka, A Pati, AM Shende.
  Mobile Networks and Applications, Vol 10, Issue 1-2, Feb-Apr 2005, p89-98. [ DOI ] [ pdf ] [ Citations ]
  Abstract
  In this paper we first present a uniformity property that characterises optimal channel assignments for networks arranged as cellular or square grids. Then, we present optimal channel assignments for cellular and square grids; these assignments exhibit a high value for delta_1 - the separation between channels assigned to adjacent stations. We prove an upper bound on delta_1 for such optimal channel assignments. This upper bound is greater than the value of delta_1 exhibited by our assignments. Based on empirical evidence, we conjecture that the value our assignments exhibit is a tight upper bound on delta_1.
Conference Proceedings - Show/Hide
1. Detection of Exfiltration and Tunneling over DNS.
  A Das, M-Y Shen, M Shashanka and J Wang
  IEEE Intl. Conf. on Machine Learning and Applications, Cancun, Mexico, Dec 2017. [ Citations ]
  Abstract
  This paper proposes a method to detect two primary means of using the Domain Name System (DNS) for malicious purposes. We develop machine learning models to detect information exfiltration from compromised machines and the establishment of command & control (C&C) servers via tunneling. We validate our approach by experiments where we successfully detect a malware used in several recent Advanced Persistent Threat (APT) attacks. The novelty of our method is its robustness, simplicity, scalability, and ease of deployment in a production environment.
2. User and Entity Behavior Analytics for Enterprise Security.
  M Shashanka, M-Y Shen, J Wang
  IEEE Intl. Conf. on Big Data, Washington DC, USA, Dec 2016. [ pdf ] [ Citations ]
  Abstract
  This paper presents an overview of an intelligence platform we have built to address threat hunting and incident investigation use-cases in the cyber security domain. Specifically, we focus on User and Entity Behavior Analytics (UEBA) modules that track and monitor behaviors of users, IP addresses and devices in an enterprise. Anomalous behavior is automatically detected using machine learning algorithms based on Singular Values Decomposition (SVD). Such anomalous behavior indicative of potentially malicious activity is alerted to analysts with relevant contextual information for further investigation and action. We provide a detailed description of the models, algorithms and implementation underlying the module and demonstrate the functionality with empirical examples.
3. Collective Spammer Detection in Evolving Multi-Relational Social Networks.
  S Fakhraei, J Foulds, M Shashanka, L Getoor
  21st ACM SIGKDD Conf. on Knowledge Discovery and Data Mining (KDD), Sidney, Australia, Aug 2015. [ pdf ] [ Citations ]
  Abstract
  Detecting unsolicited content and the spammers who create it is a long-standing challenge that affects all of us on a daily basis. The recent growth of richly-structured social net- works has provided new challenges and opportunities in the spam detection landscape. Motivated by the Tagged.com social network, we develop methods to identify spammers in evolving multi-relational social networks. We model a social network as a time-stamped multi-relational graph where vertices represent users, and edges represent different ac- tivities between them. To identify spammer accounts, our approach makes use of structural features, sequence mod- elling, and collective reasoning. We leverage relational se- quence information using k-gram features and probabilistic modelling with a mixture of Markov models. In order to per- form collective reasoning and improve the predictive power of a noisy abuse reporting system, we develop a statistical relational model using hinge-loss Markov random fields (HL- MRFs), a class of probabilistic graphical models which are highly scalable. We use Graphlab Create and Probabilistic Soft Logic (PSL) to prototype and experimentally evaluate our solutions on internet-scale data from Tagged.com. Our experiments demonstrate the effectiveness of our approach, and show that models which incorporate the multi-relational nature of the social network significantly gain predictive performance over those that do not.
4. Maximally Bijective Discretization for Data-Driven Modeling of Complex Systems.
  S Sarkar, A Srivastav, M Shashanka
  2013 American Control Conference, Washington, DC, Jun 2013. [ pdf ] [ Citations ]
  Abstract
  Phase-space discretization is a necessary step for study of continuous dynamical systems using a language-theoretic approach. It is also critical for many machine learning techniques, e.g., probabilistic graphical models (Bayesian Networks, Markov models). This paper proposes a novel discretization method – Maximally Bijective Discretization, that finds a discretization on the dependent variables given a discretization on the independent variables such that the correspondence between input and output variables in the continuous domain is preserved in discrete domain for the given dynamical system.
5. An Integrated Infrastructure for Real-Time Building Energy Modeling and Fault Detection and Diagnostics.
  B Dong, Z O'Neill, Z Li, D Luo, M Shashanka, S Ahuja, T Bailey
  SimBuild 2012, Madison, WI, Aug 2012. [ pdf ] [ Citations ]
6. Simplex Decompositions using SVD and PLSA.
  M Shashanka, MJ Giering
  Intl. Conf on Pattern Recognition Applications and Methods, Vilamoura, Portugal, Feb 2012. [ pdf ] [ Citations ]
  Abstract
  Probabilistic Latent Semantic Analysis (PLSA) is a popular technique to analyze non-negative data where multinomial distributions underlying every data vector are expressed as linear combinations of a set of basis distributions. These learned basis distributions that characterize the dataset lie on the standard simplex and themselves represent corners of a simplex within which all data approximations lie. In this paper, we describe a novel method to extend the PLSA decomposition where the bases are not constrained to lie on hte standard simplex and thus are better able to characterize the data. The locations of PLSA basis distributions on the standard simplex depend on how the dataset is aligned with respect to the standard simplex. If the directions of maximum variance of the dataset are orthogonal to the standard simplex, then the PLSA bases will give a poor representation of the dataset. Out approach overcomes this drawback by utilizing Singular Values Decomposition (SVD) to identify the directions of maximum variane, and transforming the dataset to align these directions paralle to the standard simplex before performing PLSA. These learned PLSA features are then transformed back into the data space. The effectiveness of the proposed approach is demonstrated with experiments on synthetic data.
7. Copula Functions for Learning Multimodal Densities with Non-linear Dependencies.
  A Tewari, M Shashanka, M Giering
  NIPS Workshop on Copulas in Machine Learning, Sierra Nevada, Spain, Dec 2011. [ pdf ] [ Poster ] [ Citations ]
8. Real Time Model-Based Energy Diagnostics in Buildings.
  Z O'Neill, M Shashanka, X Pang, P Bhattacharya, T Bailey, P Haves
  12th Conf. on Intl. Building Perf. Sim. Assoc., Sydney, Australia, Nov 2011. [ pdf ] [ Citations ]
9. A Fast Algorithm for Discrete HMM Training using Observed Transitions.
  M Shashanka
  IEEE Intl. Conf on Acoustics, Speech and Signal Processing, Prague, Czech Republic, May 2011. [ pdf ] [ Citations ]
  Abstract
  We present a new algorithm to estimate the parameters of a Hidden Markov Model (HMM), specifically the transition probability matrix of the hidden states and the emission probabilities, given an observed sequence of data. The algorithm uses the number of transitions present in the observed label sequence and computes parameters in an iterative fashion. We present experiments that demonstrate significant speed gains obtained by the current algorithm as compared to traditional algorithms such as Baum-Welch iterations.
10. A Privacy Preserving Framework for Gaussian Mixture Models.
  M Shashanka
  IEEE Intl. Workshop on Privacy Aspects of Data Mining, Sydney, Australia, Dec 2010. [ pdf ] [ Citations ]
  Abstract
  This paper presents a framework for privacy-preserving Gaussian Mixture Model computations. Specifically, we consider a scenario where a central service wants to learn the parameters of a Gaussian Mixture Model from private data distributed among multiple parties with privacy constraints. In addition, the service also has security constraints where none of the data owners are allowed to learn the values of the trained parameters. We use Secure Multiparty Computations to propose a framework that allows such computations.In addition, we also show how such a central service can classify new test data from privacy constrained third parties without exposing the learned models. The classification occurs with the added constraint that the service learns no information about either the test data or the result of the classification.
11. Probabilistic Latent Component Analysis for Gearbox Vibration Source Separation.
  J Isom, M Shashanka, A Tewari, A Lazarevic
  Annual Conference of the PHM Society, Portland, Oregon, Oct 2010. [ pdf ] [ Citations ]
  Abstract
  Probabilistic Latent Component Analysis (PLCA) is applied to the problem of gearbox vibration source separation. A model for the probability distribution of gearbox vibration employs a latent variable intended to correspond to a particular vibration source, with the measured vibration at a particular sensor for each source the product of a marginal distribution of vibration by frequency, a marginal distribution of vibration by shaft rotation, and a sensor weight distribution. An expectation-maximization algorithm is used to approximate a maximum-likelihood parametrization for the model. In contrast to other unsupervised source separation methods, PLCA allows for separation of vibration sources when there are fewer vibration sensors than vibration sources. Once the vibration components of a healthy gearbox have been identified, the vibration characteristics of damaged gearbox elements can be determined. The efficacy of the technique is demonstrated with an application on a gearbox vibration data set.
12. Topic Models for Audio Mixture Analysis.
  P Smaragdis, M Shashanka, B Raj
  NIPS Workshop on Applications for Topic Models: Text and Beyond, Vancouver, Canada, Dec 2009. [ pdf ] [ Citations ]
13. A Sparse Non-parametric Approach for Single Channel Separation of Known Sounds.
  P Smaragdis, M Shashanka, B Raj
  Neural Information Processing Systems Conference (NIPS), Vancouver, Canada, Dec 2009. [ pdf ] [ Citations ]
  Abstract
  In this paper we present an algorithm for separating mixed sounds from a monophonic recording. Our approach makes use of training data which allows us to learn representations of the types of sounds that compose the mixture. In contrast to popular methods that attempt to extract com- pact generalizable models for each sound from training data, we employ the training data itself as a representation of the sources in the mixture. We show that mixtures of known sounds can be described as sparse com- binations of the training data itself, and in doing so produce significantly better separation results as compared to similar systems based on compact statistical models.
14. Simplex Decompositions for Real-valued Datasets.
  M Shashanka.
  IEEE Intl Workshop on Machine Learning and Signal Processing, Grenoble, France, Sep 2009. [ DOI ] [ pdf ] [ code ] [ Citations ]
  Abstract
  In this paper, we introduce the concept of Simplex Decompositions and present a new semi-nonnegative decomposition technique that works with real-valued datasets. The motivation stems from the limitations of topic models such as Probabilistic Latent Semantic Analysis (PLSA), that have found wide use in the analysis of non-negative data apart from text corpora such as images, audio spectra, gene array data among others. The goal of this paper is to remove the non-negativity requirement for datasets so that these models can work on datasets with both positive and negative entries. We start by showing that PLSA is equivalent to finding a set of components that define the corners of a simplex within which all datapoints lie. We formalize this intuition by introducing the notion of simplex decompositions - PLSA and extensions are specific examples - and generalize the idea to be applicable to arbitrary real datasets with both positive and negative entries. We present algorithms and illustrate the method with examples.
15. Missing Data Imputation for Spectral Audio Signals.
  P Smaragdis, B Raj, M Shashanka.
  IEEE Intl Workshop on Machine Learning and Signal Processing, Grenoble, France, Sep 2009. [ DOI ] [ pdf ] [ Citations ]
  Abstract
  With the recent attention to audio processing in the time-frequency domain, we increasingly encounter the problem of missing data. In this paper, we present an approach that allows for imputing missing values in the time-frequency domain of audio signals. The presented approach is able to deal with real-world polyphonic signals by performing imputation even in the presence of complex mixtures. We show that this approach outperforms generic imputation approaches, and we present a variety of situations that highlight its utility.
16. Mining Retail Transaction Data for Targeting Customers with Headroom - A Case Study.
  M Shashanka, M Giering.
  Artificial Intelligence Applications and Innovations, Greece, April 2009. [ DOI ] [ pdf ] [ Citations ]
  Abstract
  We outline a method to model customer behavior from retail transaction data. In particular, we focus on the problem of recommending relevant products to consumers. Addressing this problem of filling holes in the baskets of consumers is a fundamental aspect for the success of targeted promotion programs. Another important aspect is the identification of customers who are most likely to spend significantly and whose potential spending ability is not being fully realized. We discuss how to identify such customers with headroom and describe how relevant product categories can be recommended. The data consisted of individual transactions collected over a span of 16 months from a leading retail chain. The method is based on Singular Value Decomposition and can generate significant value for retailers.
17. Probabilistic Factorization of Non-Negative Data with Entropic Co-occurrence Constraints.
  P Smaragdis, M Shashanka, B Raj, GJ Mysore.
  Intl Conf on Independent Component Analysis, Brazil, March 2009. [ DOI ] [ pdf ] [ Citations ]
  Abstract
  In this paper we present a probabilistic algorithm which factorizes non-negative data. We employ entropic priors to additionally satisfy that user specified pairs of factors in this model will have their cross entropy maximized or minimized. These priors allow us to construct factorization algorithms that result in maximally statistically different factors, something that generic non-negative factorization algorithms cannot explicitly guarantee. We further show how this approach can be used to discover clusters of factors which allow a richer description of data while still effectively performing a low rank analysis.
18. Sparse and Shift-Invariant Feature Extraction from Non-Negative Data.
  P Smaragdis, B Raj, M Shashanka.
  IEEE Intl Conf on Acoustics, Speech and Signal Processing, Las Vegas, Nevada, Apr 2008. [ DOI ] [ pdf ] [ Citations ]
  Abstract
  In this paper we describe a technique that allows the extraction of multiple local shift-invariant features from analysis of non-negative data of arbitrary dimensionality. Our approach employs a probabilistic latent variable model with sparsity constraints. We demonstrate its utility by performing feature extraction in a variety of domains ranging from audio to images and video.
19. Sparse Overcomplete Latent Variable Decomposition of Counts Data. [ Fig.1 Data ]
  M Shashanka, B Raj, P Smaragdis.
  Neural Information Processing Systems Conference (NIPS), Vancouver, Canada, Dec 2007. [ pdf ] [ supplement ] [ Citations ]
  Abstract
  An important problem in many fields is the analysis of counts data to extract meaningful latent components. Methods like Probabilistic Latent Semantic Analysis (PLSA) and Latent Dirichlet Allocation (LDA) have been proposed for this purpose. However, they are limited in the number of components they can extract and lack an explicit provision to control the expressiveness of the extracted components. In this paper, we present a learning formulation to address these limitations by employing the notion of sparsity. We start with the PLSA framework and use an entropic prior in a maximum a posteriori formulation to enforce sparsity.We show that this allows the extraction of overcomplete sets of latent components which better characterize the data. We present experimental evidence of the utility of such representations.
20. Privacy-Preserving Musical Database Matching.
  M Shashanka, P Smaragdis.
  IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, New Paltz, New York, Oct 2007. [ DOI ] [ pdf ] [ Citations ]
  Abstract
  In this paper we present an illustratory process which allows privacy-preserving transactions in the context of musical databases. In particular we address the problem of matching a piece of music audio to a service database in such a way such that the database provider will not directly observe the query, nor its result, thereby preserving the privacy of the inquirer. We formulate this process within the field of secure multiparty computation and show how such a transaction can be achieved once we derive secure versions of basic signal processing operations.
21. Supervised and Semi-Supervised Separation of Sounds from Single-Channel Mixtures.
  P Smaragdis, B Raj, M Shashanka.
  Intl Conf on Independent Component Analysis, London, UK, Sep 2007. [ DOI ] [ pdf ] [ Citations ]
  Abstract
  In this paper we describe a methodology for model-based single channel separation of sounds. We present a sparse latent variable model that can learn sounds based on their distribution of time/frequency energy. This model can then be used to extract known types of sounds from mixtures in two scenarios. One being the case where all sound types in the mixture are known, and the other being being the case where only the target or the interference models are known. The model we propose has close ties to non-negative decompositions and latent variable models commonly used for semantic analysis.
22. Sparse Overcomplete Decomposition for Single Channel Speaker Separation. [ Examples ]
  M Shashanka, B Raj, P Smaragdis.
  IEEE Intl Conf on Acoustics, Speech and Signal Processing, Honolulu, Hawaii, April 2007. [ DOI ] [ pdf ] [ Citations ]
  Abstract
  We present an algorithm for separating multiple speakers from a mixed single channel recording. The algorithm is based on a model proposed by Raj and Smaragdis (2005). The idea is to extract certain characteristic spectra-temporal basis functions from training data for individual speakers and decompose the mixed signals as linear combinations of these learned bases. In other words, their model extracts a compact code of basis functions that can explain the space spanned by spectral vectors of a speaker. In our model, we generate a sparse-distributed code where we have more basis functions than the dimensionality of the space. We propose a probabilistic framework to achieve sparsity. Experiments show that the resulting sparse code better captures the structure in data and hence leads to better separation.
23. A Framework for Secure Speech Recognition.
  P Smaragdis, M Shashanka.
  IEEE Intl Conf on Acoustics, Speech and Signal Processing, Honolulu, Hawaii, April 2007. [ DOI ] [ pdf ]
  Abstract
  We present an algorithm that enables privacy-preserving speech recognition transactions between multiple parties. We assume two commonplace scenarios. One being the case where one of two parties has private speech data to be transcribed and the other party has private models for speech recognition. And the other being that of one party having a speech model to be trained using private data of multiple other parties. In both of the above cases data privacy is desired from both the data and the model owners. In this paper we will show how such collaborations can be performed while ensuring no private data leaks using secure multiparty computations. In neither case will any party obtain information on other parties data. The protocols described herein can be used to construct rudimentary speech recognition systems and can be easily extended for arbitrary audio and speech processing.
24. Bandwidth Expansion with a Polya Urn Model. [ Examples ]
  B Raj, R Singh, M Shashanka, P Smaragdis.
  IEEE Intl Conf on Acoustics, Speech and Signal Processing, Honolulu, Hawaii, April 2007. [ DOI ] [ pdf ] [ Citations ]
  Abstract
  We present a new statistical technique for the estimation of the high frequency components (4-8 kHz) of speech signals from narrow-band (0-4 kHz) signals. The magnitude spectra of broadband speech are modelled as the outcome of a Polya Urn process, that represents the spectra as the histogram of the outcome of several draws from a mixture multinomial distribution over frequency indices. The multinomial distributions that compose this process are learnt from a corpus of broadband (0-8 kHz) speech. To estimate high-frequency components of narrow-band speech, its spectra are also modelled as the outcome of draws from a mixture-multinomial process that is composed of the learnt multinomials, where the counts of the indices of higher frequencies have been obscured. The obscured high-frequency components are then estimated as the expected number of draws of their indices from the mixture-multinomial. Experiments conducted on bandlimited signals derived from the WSJ corpus show that the proposed procedure is able to accurately estimate the high frequency components of these signals.
25. Separating a Foreground Singer from Background Music. [ Examples ]
  B Raj, P Smaragdis, M Shashanka, R Singh.
  Intl Symposium on Frontiers of Research on Speech and Music (FRSM), Mysore, India, Jan 2007. [ pdf ] [ Citations ]
  Abstract
  In this paper we present a algorithm for separating singing voices from background music in popular songs. The algorithm is derived by modelling the magnitude spectrogram of audio signals as the outcome of draws from a discrete bi-variate random process that generates time-frequency pairs. The spectrogram of a song is assumed to have been obtained through draws from the distributions underlying the music and the vocals, respectively. The parameters of the underlying distribuiton are learnt from the observed spectrogram of the song. The spectrogram of the separated vocals is then derived by estimating the fraction of draws that were obtained from its distribution. In the paper we present the algorithm within a framework that allows personalization of popular songs, by separating out the vocals, processing them appropriately to oneâs own tastes, and remixing them. Our experiments reveal that we are effectively able to separate out the vocals in a song and personalize them to our tastes.
26. A Probabilistic Latent Variable Model for Acoustic Modeling.
  P Smaragdis, B Raj, M Shashanka.
  Workshop on Advances in Models for Acoustic Processing, NIPS 2006. [ pdf ] [ Citations ]
  Abstract
  In this paper we describe a model developed for the analysis of acoustic spectra. Unlike decompositions techniques that can result in difficult to interpret results this model explicitly models spectra as distributions and extracts sets of additive and semantically useful components that facilitate a variety of applications ranging from source separation, denoising, music transcription and sound recognition. This model is probabilistic in nature and is easily extended to produce sparse codes, and discover transform invariant components which can be optimized for particular applications.
27. Secure Sound Classification: Gaussian Mixture Models.^*
  M Shashanka, P Smaragdis.
  IEEE Intl. Conf. on Acoustics, Speech and Signal Processing, Toulouse, France, May 2006. [ DOI ] [ pdf ] [ Citations ]
  ^* Finalist in the Student Paper Contest.
  Abstract
  We propose secure protocols for Gaussian mixture-based sound recognition. The protocols we describe allow varying levels of security between two collaborating parties. The case we examine consists of one party (Alice) providing data and other party (Bob) providing a recognition algorithm. We show that it is possible to have Bob apply his algorithm on Alice's data in such a way that the data and the recognition results will not be revealed to Bob thereby guaranteeing Alice's data privacy. Likewise we show that it is possible to organize the collaboration so that a reverse engineering of Bob's recognition algorithm cannot be performed by Alice. We show how Gaussian mixtures can be implemented in a secure manner using secure computation primitives implementing simple numerical operations and we demonstrate the process by showing how it can yield identical results to a non-secure computation while maintaining privacy.
28. Latent Dirichlet Decomposition for Single Channel Speaker Separation. [ Examples ]
  B Raj, M Shashanka, P Smaragdis.
  IEEE Intl. Conf. on Acoustics, Speech and Signal Processing, Toulouse, France, May 2006. [ DOI ] [ pdf ] [ Citations ]
  Abstract
  We present an algorithm for the separation of multiple speakers from mixed single-channel recordings by latent variable decomposition of the speech spectrogram. We model each magnitude spectral vector in the short-time Fourier transform of a speech signal as the outcome of a discrete random process that generates frequency bin indices. The distribution of the process is modeled as a mixture of multinomial distributions, such that the mixture weights of the component multinomials vary from analysis window to analysis window. The component multinomials are assumed to be speaker specific and are learned from training signals for each speaker. We model the prior distribution of the mixture weights for each speaker as a Dirichlet distribution. The distributions representing magnitude spectral vectors for the mixed signal are decomposed into mixtures of the multinomials for all component speakers. The frequency distribution, i.e the spectrum for each speaker, is reconstructed from this decomposition.
29. Optimal Multi-Channel Data Allocation with Flat Broadcast per Channel.^*
  AA Bertossi, MC Pinotti, S Ramaprasad, R Rizzi, M Shashanka.
  Intl. Parallel and Distributed Processing Symposium, Santa Fe, USA, Apr 2004. [ DOI ] [ pdf ] [ Citations ]
  ^*Authors listed in alphabetical order.
  Abstract
  Broadcast is an efficient and scalable way of transmitting data to an unlimited number of clients that are listening to a channel. Cyclically broadcasting data over the channel is a basic scheduling technique, which is known as flat scheduling. When multiple channels are available, partitioning data among channels in an unbalanced way, depending on data popularities, is an allocation technique known as skewed allocation. In this paper, the problem of data broadcasting over multiple channels is considered assuming skewed data allocation to channels and fiat data scheduling per channel, with the objective of minimizing the average waiting time of the clients. Several algorithms, based on dynamic programming, are presented which provide optimal solutions for N data items and K channels. Specifically, for data items with uniform lengths, an O(NKlogN) time algorithm is proposed, which improves over the previously known O(N/sup 2/K) time algorithm. When K /spl les/ 4, faster O(N) time algorithms are exhibited. Moreover, for data items with nonuniform lengths, it is shown that the problem is NP-hard when K = 2, and strong NP-hard for arbitrary K. In the former case, a pseudo-polynomial algorithm is discussed, whose time is O(NZ) where Z is the sum of the data lengths.
30. A Characterisation of Optimal Channel Assignments for Wireless Networks Modelled as Cellular and Square Grids.
  M Shashanka, A Pati, AM Shende.
  Intl. Parallel and Distributed Processing Symposium, Nice, France, Apr 2003. [ DOI ] [ pdf ] [ Citations ]
  Abstract
  In this paper we first present a uniformity property that characterises optimal channel assignments for networks arranged as cellular or square grids. Then, we present optimal channel assignments for cellular and square grids; these assignments exhibit a high value for /spl delta//sub 1/ - the separation between channels assigned to adjacent stations. Based on empirical evidence, we conjecture that the value our assignments exhibit is an upper bound on /spl delta//sub 1/.
31. Channel Assignment for Wireless Networks Modelled as d-Dimensional Square Grids.
  A Dubhashi, M Shashanka, A Pati, S Ramaprasad, AM Shende.
  Intl. Workshop on Distributed Computing, Kolkata, India, Dec 2002. [ DOI ] [ pdf ] [ Citations ]
  Abstract
  In this paper, we study the problem of channel assignment for wireless networks modelled as d-dimensional grids. In particular, for d-dimensional square grids, we present optimal assignments that achieve a channel separation of 2 for adjacent stations where the reuse distance is 3 or 4. We also introduce the notion of a colouring schema for d-dimensional square grids, and present an algorithm that assigns colours to the vertices of the grid satisfying the schema constraints.
Posters / Abstracts - Show/Hide
1. Mining Retail Data for Targeting Customers with Headroom*.
  Madhu Shashanka, Michael Giering.
  NYAS Annual Machine Learning Symposium, New York, NY, Oct 2008. [ Abstract ] [ Symposium ]
  * Poster not presented as the authors were unable to attend.
2. Separating and Understanding a Talker from a Mixture in Reverberant Spaces.
  Barbara Shinn-Cunningham, Scott Bressler, Madhu Shashanka.
  152nd meeting of the Acoustical Society of America (ASA), Honolulu, Hawaii, Dec 2006. [ Abstract ]
3. The Role of Fundamental Frequency in Segregating and Understanding a Talker Competing with Another Talker in a Reverberant Setting.
  Madhu Shashanka, Barbara Shinn-Cunningham, Sigrid Nasser.
  29th midwinter meeting of the Association for Research in Otolaryngology (ARO), Baltimore, Feb 2006. [ Abstract ]
4. Effects of Reverberant Energy on Statistics of Speech.
  Madhu Shashanka, Barbara Shinn-Cunningham, Martin Cooke.
  Workshop on Speech Separation and Comprehension in Complex Acoustic Environments, Montreal, Canada, Nov 2004. [ pdf ] [ Workshop Website ]
Technical Reports / Manuscripts - Show/Hide
1. Scalable Deployment of Advanced Building Energy Management Systems.
  V Adetola, S Ahuja, T Bailey, B Dong, T Khawaja, D Luo, Z O'Neill, M Shashanka.
  ESTCP Final Report EW-1015, Oct 2012.
2. Automated Continuous Commissioning of Commercial Buildings.
  T Bailey, Z O'Neill, M Shashanka, P Haves, X Pang, P Bhattacharya.
  ESTCP Final Report EW-0929, Sep 2011. [ pdf ] [ Citations ]
3. Probabilistic Latent Variable Model for Sparse Decompositions of Non-negative Data.
  M Shashanka, B Raj, P Smaragdis.
  Unpublished Draft, March 2007. [ pdf ] [ supplement ] [ Citations ]
4. Optimal Multi-Channel Data Allocation with Flat Broadcast per Channel.^*
  AA Bertossi, MC Pinotti, S Ramaprasad, R Rizzi, M Shashanka.
  Technical Report DIT-02-0079, University of Trento, 2003. [ ps ] [ pdf ]
  ^*Authors listed alphabetically.
Media Articles and Mentions - Show/Hide
1. Concentric AI Shares Data Security and AI Predictions for 2026
  VM Blog, Dec 2025. [ Article ]
2. Securing GenAI Use Requires a New take on Data Security governance
  Cybersecurity Insiders, Dec 2025. [ Article ]
3. Generative AI – The Under-Appreciated Consequences for Data Security
  AIthority, Nov 2025. [ Article ]
4. Addressing the Data Security Risks of Microsoft Copilot
  Cybersecurity Insiders, Aug 2025. [ Article ]
5. Understanding GenAI Applications in Cybersecurity
  The AI Journal, June 2025. [ Article ]
6. Interview - Spiking Neural Networks: Brain-Inspired Chips That could keep your Data Safe
  Security Week, May 2025. [ Article ]
7. RSA and the Agentic AI Bandwagon
  Concentric AI, May 2025. [ Article ]
8. Top 10 Data Security Best Practices for 2025
  Beta News, Mar 2025. [ Article ]
9. Exploring Generative AI Applications in Cybersecurity
  Concentric AI, Jan 2025. [ Article ]
10. LW RoundTable: Predictive analytics to solidify cyber defenses in 2025
  Byron Acohido, The Last Watchdog, Dec 2024. [ Article ]
11. How AI will both threaten and protect data in 2025
  Cybersecurity Insiders, Dec 2024. [ Article ]
12. Concentric helps companies keep track of their sensitive data
  Kyle Wiggers, TechCrunch, Oct 2024. [ Article ]
13. Interview - What Is the Difference Between AI, Machine Learning, and Deep Learning?
  Nathan Eddy, Information Week, May 2024. [ Article ]
14. Interview - How I got started: AI Security Researcher
  Mark Stone, Security Intelligence, May 2024. [ Article ]
15. AI in Cybersecurity - Exploring Opportunities and Dangers
  Cybersecurity Insiders, July 2023. [ Article ]
16. Interview - Can Large Language Models Boost your Security Posture
  Mark Stone, Security Intelligence, May 2023. [ Article ]
17. Three Steps to University Data Protection
  Campus Security Today, July 2021. [ Article ] [ Blog ]
18. Interview - AI Predictions and Guidance for any Organization
  Rich Tehrani, TMCNet, Jan 2021. [ Article ] [ Blog ]
19. Interview - Ethics in AI
  Byron Acohido, Security Boulevard, July 2020. [ Article ] [ Blog ]
20. Concentric raises $7.5 million to identify and protect sensitive enterprise data with AI
  Kyle Wiggers, VentureBeat, Jan 2020. [ Article ] [ Blog ]
Talks / Presentations / Misc. - Show/Hide
1. GenAI - A Generational Paradigm Shift Redefining Data Security
  Tech Show, London, Mar 2026.
2. Securing AI in Practice - Use Cases, Governance, and Business Alignment
  Summary Discussion Leader, Gartner C-level Communities CIO and CISO Inner Circles (UK and Ireland), Mar 2026.
3. Data Security Governance: Preparing for Gen AI
  NHS Interoperabilty and Systems Conference, Birmingham, Mar 2026.
4. GenAI: A Generational Paradigm Shift
  Concentric AI Publication, Oct 2025. [ Whitepaper ]
5. Redefining Data Security in the GenAI Era
  Information Warfare Summit, Edmond OK, Oct 2025.
6. Resilient Leadership - Optimizing Cybersecurity in a Dynamic World
  Summary Discussion Leader, Gartner C-level Communities CIO and CISO Inner Circles, Sep 2025.
7. Data Security - The Missing Context
  CIO Scotland, Edinburgh, Sep 2025.
8. Strategic Leadership - Maximizing Digital investments Amid Unprecedented Advancements
  Summary Discussion Leader, Gartner C-level Communities CIO and CISO Inner Circles, Aug/Sep 2025.
9. AI in Action - Cybersecurity Strategies for Resilient Enterprises
  Summary Discussion Leader, Gartner C-level Communities CIO and CISO Inner Circles, 2025.
10. Building AI Products - Semantic Intelligence
  Seminar Presentation, School of Science and Mathematics, DES University, Pune, Apr 2025.
11. Securing Gen AI with Gen AI
  Cybersecurity Summit, Atlanta, GA, 31 Jan 2025.
12. AI Product Development - Lessons from Startups
  Guest Lecture, Graduate courses in Deep Learning and Reinforcement Learning, San Jose State University, San Jose, CA, 21 Nov 2024.
13. A Journey through Buzzwords - Data Mining, Predictive Analytics, Big Data, Data Science, Generative AI...
  Invited Talk, Neuroscience Institute - Carnegie Mellon University, Pittsburgh, PA, 18 Apr 2024.
14. Building AI Products
  Invited Talk, Computer Science Seminar Series, St. Edwards University, Austin, TX, 02 Feb 2024. [ Presentation ]
15. AI @ Startups - Stories from the Field
  Concentric AI Tech Summit, Pune, India, 17 Dec 2022. [ Presentation ]
16. Deep Learning for Data Security Professionals
  Concentric AI Publication, 2020. [ eBook ]
17. Panel Discussion, Productization of Machine Learning.
  TX Analytics Summit, Austin, TX, 28 Sep 2018. [ Conference ]
18. Master of Science in Business Analytics Speaker Series.
  McCombs School of Business, UT Austin, Austin, TX, 03 Aug 2018. [ Link ]
19. Data Science and Relevance at Tagged.
  Zipfian Academy, San Francisco, CA, 17 Jul 2014.
20. Fast Algorithms for Approximate Online SVD.
  Tagged, San Francisco, CA, Mar 2014.
21. Privacy Preserving Speech Processing (tutorial with Dr. Bhiksha Raj).
  ISCA Conference (InterSpeech), Portland, OR, 09 Sep 2012. [ Abstract ]
22. Privacy Preserving Computations.
  United Technologies Research Center, Hartford, CT, 06 Sep 2012.
23. Implementing Bayesian Networks.
  United Technologies Research Center, Hartford, CT, 18 Nov 2010.
24. Topic Models: Applications.
  United Technologies Research Center, Hartford, CT, 10 Jun 2010.
25. Topic Models: Overview.
  United Technologies Research Center, Hartford, CT, 27 May 2010.
26. Probabilistic Latent Variable Models.
  United Technologies Research Center, Hartford, CT, 04 Nov 2009.
27. Simplex Decompositions - Techniques for Nonnegative Data Analysis and Extensions.
  United Technologies Research Center, Hartford, CT, 03 Sep 2009.
28. Simplex Decompositions.
  Icosystem Corporation, Boston, MA, 14 Aug 2009.
29. Latent Variable Framework for Modeling and Separating Single-Channel Acoustic Sources.
  Cognitive and Neural Systems Dissertation Defense, Boston University, Boston, MA, 17 Aug 2007. [ Announcement ]
30. Latent Variable Decomposition - Models and Applications (with Dr. Bhiksha Raj).
  MIT Brains and Machines Seminar, Cambridge, MA, 16 May 2007. [ Announcement ]
31. Probabilistic Latent Semantic Analysis.
  Catalyst Group - Mars Inc., Brussels, Belgium, 02 May 2007.
32. Probabilistic Models for Single-Channel Audio Processing.
  East Bay Institute of Research and Education, Martinez, CA, 26 Apr 2007.
33. Probabilistic Models for Single-Channel Audio Processing.
  Boston University Hearing Research Center Seminar, Boston, MA, 19 Jan 2007. [ Announcement ]
34. Probabilistic Models for Acoustic Processing.
  Mitsubishi Electric Research Laboratories, Cambridge, MA, 21 Nov 2006.
Patents - Show/Hide
1. Methods and Systems for Determining Anomalous User Access to Data Objects.
  Shashanka et al, US Patent
2. Method and Electronic Device to Assign Appropriate Semantic Categories to Documents with Arbitrary Granularity.
  Shashanka et al, US Patent
3. Methods and Systems for Optimizing Grouping for Enhanced Access Control and Risk Mitigation in an Enterprise.
  Shashanka et al, US Patent
4. Methods and Systems for Identifying Anomalous Users Exhibiting Overreaching Access Permissions to Data Objects.
  Shashanka et al, US Patent
5. Methods and Systems for Clustering Documents based on Semantic Similarity.
  Shashanka et al, US Patent
6. Method and Electronic Device for Managing Sensitive Data based on Semantic Categorization.
  Shashanka et al, US Patent
7. Method and Electronic Device for Generating Semantic Representation of Document to Determine Data Security Risk.
  Shashanka et al, US Patent
8. System and Method Directed to Behavioral Profiling Services.
  J Wang, M Shashanka, C Yang, M-Y Shen, US Patent 10505959
9. Sensor Data Fusion for Prognostics and Health Monitoring.
  M Giering, M Shashanka, S Sarkar, V Venugopalan, US Patent 20180217585
10. Vibration Signatures for Prognostics and Health Monitoring of Machinery.
  M Giering and M Shashanka, US Patent Application 20170277995
11. Gear Fault Detection.
  J Isom, Z Chaudhry, G Zhang, F Sun, M Shashanka, Y Chen, US Patent 9482647
12. Method and System for Matching Audio Recordings.
  P Smaragdis and M Shashanka, US Patent 8055662
13. System and Method for Recognizing Speech Securely.
  P Smaragdis and M Shashanka, US Patent 7937270
14. Secure Classification of Data with Gaussian Distributions.
  M Shashanka and P Smaragdis, US Patent 7526084

Madhusudana Shashanka Founder, Chief Scientist

List of Journal and Conference Papers, Posters and Abstracts, Technical Reports, Media Articles, Talks and Patents

Madhusudana Shashanka
Founder, Chief Scientist