Overview

I am a Professor of Statistical Science at Duke University and a faculty member of Duke Institute of Brain Sciences. Statistics is an academic discipline that is founded on not one but two philosophical cornerstones: the Bayesian and the Frequentist principles of quantifying uncertainty. Overshadowing the complementary qualities of these two principles, their conflicting aspects have often caused divisions in the practice of statistics for scientific use. By innovating new statistical methods rooted in Bayesian inference and examining their Frequentist behavior, I have made a modest attempt at straddling the fault lines of these divisions.

Brief Bio. I received my statistics education from Indian Statistical Institute, Kolkata (BStat 2000, MStat 2002) and completed my doctoral research at Purdue University (PhD 2006) under the supervision of JK Ghosh. My doctoral thesis won the Leonard J Savage Award (Theory) from ISBA. I spent the next three years at Carnegie Mellon University as the Morris H DeGroot Visiting Assistant Professor, where my statistics thinking and research interests were deeply influenced by Rob Kass and Jay Kadane. I joined Duke University in 2009 as an Assistant Professor and was promoted to Associate Professor in 2016, and to Professor in 2022. I received the Young Statistician Award from IISA in 2016. I have been a member of ISBA, ASA, IMS and IISA and over the years I have served in various elected roles in these academic societies. Click here to download a CV as a PDF file.

Select Publication. Below is a partial list of papers I have authored or coauthored, with a selection bias toward more recent work. A complete list is available at my Google Scholar profile.

Tokdar, ST, Jiang, S and Cunnlingham, EL (2022). Heavy-Tailed Density Estimation. Journal of the American Statistical Association (Theory & Methods), in press.
Chen, X and Tokdar, ST (2021). Joint quantile regression for spatial data. Journal of the Royal Statistical Society, Series B (Theory & Methods), 83(4), 826-852.
Jiang, S and Tokdar, ST (2021) Variable selection consistency of Gaussian process regression. The Annals of Statistics 49(5) 2491-2505.
Glynn, C, Tokdar, ST, Zaman, A, … and Groh, JM (2021). Analyzing second order stochasticity of neural spiking under stimuli-bundle exposure. The Annals of Applied Statistics, 15(1), 41-63.
Glynn, C, Tokdar, ST, Howard, B and Banks, DL (2019). Bayesian Analysis of Dynamic Linear Topic Models. Bayesian Analysis, 14(1) 53-80.
Caruso, VC, Mohl, JT, Glynn, C, …, Tokdar, ST and Groh, JM (2018). Single neurons may encode simultaneous stimuli by switching between activity patterns. Nature Communication 9, 2715.
Yang, Y and Tokdar, ST (2017). Joint estimation of quantile planes over arbitrary predictor spaces. Journal of the American Statistical Association, 112(519), 1107–1120.
Yang, Y and Tokdar, ST (2015). Minimax-Optimal Nonparametric Regression in High Dimensions. The Annals of Statistics, 43(2), 652-674.
Shen, W, Tokdar, ST and Ghosal, S (2013). Adaptive Bayesian Multivariate Density Estimation with Dirichlet Mixtures. Biometrika, 100(3), 623-640.

Statistics Research

I work on Nonparametric Bayes which tangles with the seemingly impossible task of extracting information out of limited data on infinitely many unknown quantities. Although such tasks are made feasible by supposing the unknown quantities arrange themselves into neat geometric shapes such as curves or surfaces, they still prove tricky to both subjective and objective Bayesian viewpoints on the question of prior allocation. The mathematics of objective prior allocation simply breaks down when faced with infinite dimensional geometry, while contemplating or communicating subjective considerations on an infinite number of items quickly overwhelms the human mind. But useful solutions exist in the middle grounds of the subjective–objective divide in the form of intersubjective Bayesian considerations aided by frequentist calculations.

Bayesian Smoothing and Posterior Consistency. Although data smoothing has existed for over three hundred years and formal statistical treatments have existed since at least 1960s, Nonparametric Bayes has made a fundamental contribution to the methodology by resolving a theoretical bottleneck: how to adjust the degree of smoothing so that information from any single data point is extracted away to an adequately sized neighboring space but not to regions at great distance. Bayesian solutions to this just-right smoothing problem have evolved on the theoretical foundation of posterior consistency and a more in-depth variation of it known as optimal posterior contraction, a mathematical construct for evaluating the asymptotic concentration rate of the posterior distribution against benchmarks set by information theoretic limits on statistical learning rates. My research has played a role in establishing the narrative that posterior consistency and optimal posterior contraction could be guaranteed by at least two distinct strategies for prior allocation on smooth function spaces: either by using an infinite Dirichlet process mixture of smooth kernels or by using a smooth Gaussian process.

Jiang, S and Tokdar, ST (2021) Variable selection consistency of Gaussian process regression. The Annals of Statistics 49(5) 2491-2505.
Yang, Y and Tokdar, ST (2015). Minimax-Optimal Nonparametric Regression in High Dimensions. The Annals of Statistics, 43(2), 652-674.
Shen, W, Tokdar, ST and Ghosal, S (2013). Adaptive Bayesian Multivariate Density Estimation with Dirichlet Mixtures. Biometrika, 100(3), 623-640
Pati, D, Dunson, DB and Tokdar, ST (2012). Posterior Consistency in Conditional Distribution Estimation. Journal of Multivariate Analysis, 116, 456-472.
Martin, RG and Tokdar, ST (2012). A Nonparametric Empirical Bayes Framework for Large-scale Significance Testing. Biostatistics, 13(3), 427-439.
Tokdar, ST and Ghosh, JK (2007). Posterior consistency of logistic Gaussian process priors in density estimation. Journal of Statistical Planning and Inference, 137(1), 34-42.
Tokdar, ST (2006). Posterior consistency of Dirichlet location-scale mixture of normals in density estimation and regression. Sankhyā, 67(4), 90-110.

Quantile Regression. Linear quantile models allow scientists to analyze how predictor influence varies across response quantiles. Such analyses, often of important scientific implication in economic and environmental sciences, require combining separate quantile regression fits from every quantile level of interest, an act of aggregation that is not founded on a coherent probabilistic model. This theoretical gap leads to legitimacy issues such as quantile crossing and quantile cherrypicking, and statistical concerns such as poor standard error estimation and limited model flexibility. My work on quantile regression has offered a comprehensive solution to this problem enabling statistical inference, prediction, model enhancement and model selection [3,4,5]. A major breakthrough of this work has been a loss–less reparametrization of a stack of non-crossing (quantile) hyperplanes in terms of unconstrained smooth functions which are directly amenable to regularized likelihood based statistical estimation [4]. The new joint estimation framework has opened doors to many important advancements of the quantile regression analysis technique to address additional data complications, e.g, censoring [2], spatiotemporal or longitudinal noise correlation [1], hierarchical structures and so on.

Chen, X and Tokdar, ST (2021). Joint quantile regression for spatial data. Journal of the Royal Statistical Society, Series B (Theory & Methods), 83(4), 826-852.
Cunningham, E, Tokdar, ST, and Clark, JS (2020). A vignette on model-based quantile regression: analysing excess zero response. In Flexible Bayesian Regression Modelling (Eds. Fan, Y, Nott, D, Smith, M and Dortet-Bernadet, J.-L.), pp 27–64.
Tokdar, ST and Cunningham, E (2019). qrjoint: Joint Estimation in Linear Quantile Regression. The Comprehensive R Archive Network.
Yang, Y and Tokdar, ST (2017). Joint estimation of quantile planes over arbitrary predictor spaces. Journal of the American Statistical Association, 112(519), 1107–1120.
Tokdar, ST and Kadane, JB (2012). Simultaneous Linear Quantile Regression: A Semiparametric Bayesian Approach. Bayesian Analysis, 7(1), 51-72.

Semiparametric Density Estimation. Density estimation is a classic smoothing exercise that is mostly considered a visualization tool. But it can substantially improve data analysis when appropriately incorporated within a hierarchical model. In [1] we argue that one can gain better accuracy and reliability in estimating the tail index of a heavy tailed distribution by fitting a suitable semiparametric density model to the entire data histogram, rather than fitting a parametric model only to thresholded data as is commonly done. [2] shows that very accurate sufficient dimension reduction, along with dimensionality selection, can be performed within a semiparametric conditional density estimation framework. [3-4] establish that substantial gains are made in power and false positive rate control in large-scale significance testing when the non-null density of the test statistic is estimated from the data.

Tokdar, ST, Jiang, S and Cunnlingham, EL (2022). Heavy-Tailed Density Estimation. Journal of the American Statistical Association (Theory & Methods), in press.
Tokdar, ST, Zhu, Y.M and Ghosh, JK (2010). Bayesian Density Regression with Logistic Gaussian Process and Subspace Projection. Bayesian Analysis, 5(2), 316-344. Codes available here.
Martin, RG and Tokdar, ST (2012). A Nonparametric Empirical Bayes Framework for Large-scale Significance Testing. Biostatistics, 13(3), 427-439.
Martin, R and Tokdar, ST (2011). Semiparametric inference in mixture models with predictive recursion marginal likelihood. Biometrika, 98(3), 567-582.

Neuroscience Research

Starting from the late nineteenth century, direct recordings of neuronal electric discharges and their mathematical and statistical analyses have played a pivotal role in understanding how the brain and the nervous system function in a variety of sensory and cognitive situations. However, many questions still remain unanswered. In particular, it is still a mystery how we perceive multiple objects present in a natural sensory scene. Sensory neurons are broadly tuned and are activated by any of several distinguishable items when presented in isolation. In scenes consisting of several such items, how is the sensory task load divided within a neural population so that information about each item could be retained? In collaboration with Jennifer Groh, we have been examining a radically new hypothesis that the brain might solve this problem via dynamic multiplexing, with each neuron juggling over time the representational tasks it is capable of performing.

We have recently completed Phase 1 of this research in which single cell recordings have revealed that throughout sensory hierarchies (from the auditory midbrain to primary visual cortex and a visual cortical face area), neurons dynamically alternate between encoding each stimuli present in a two-item scene, thus lending evidence to the credibility of our new hypothesis of multiplexing [1-3]. My statistics research interests have contributed intimately in this collaborative effort to design rigorous statistical analysis frameworks which could potentially falsify the hypothesis. Traditionally, statistical analyses of neuronal electric discharges, aka spike trains, proceed by aggregating across time (response window) and trials (replication). Detection of neuronal task juggling and quantifying its probabilistic nature have required new statistical methodology based on mixture models to encode heterogeneity of task selection, and rigorous inverse probability based tests of more than two competing hypotheses via Bayes factor calculation. A truly novel methodological development has been our Dynamic Admixture Point Process [1] model for an in-depth analysis of the temporal dynamics of task selection. DAPP offers the right statistical framework in answering a fundamental question: for neurons whose overall firing rate under double stimuli is an average of its single stimulus firing rates, do we see, when viewed in high temporal resolution, the neuron to truly average the signals or does it appear to fluctuate between the two tasks? This question could not be answered faithfully with existing hidden Markov model based statistical analysis methods which are good at modeling fluctuation, but cannot carry out rigorous assessment of how they stack up against the possibility of true averaging at high temporal resolution.

Our Phase 1 examination has not falsified our hypothesis of multiplexing despite carefully designed experiments and statistical analyses. But we are still far from producing strong evidence that multiplexing is a primary computing tool that the brain employs in representing multiple items in a crowded sensory scene. We are currently working on Phase 2 with array based spike train recordings from populations of neurons to ascertain the significance of neural fluctuation in solving the multi-item perception problem. A particular computing theory we are currently testing is that neurons within a homogeneous population may coordinate with one another in temporally dividing the task load. Our current statistics research is geared toward developing and testing Bayesian inferential models which can identify such organizational structures of functional coordination from array based recordings. Our approach combines several novel statistical modeling elements, such as stochastic block models that are traditionally used for networks analysis with sparse factor models which are typically used for learning the correlations of high dimensional recordings.

Glynn, C, Tokdar, ST, Zaman, A, … and Groh, JM (2021). Analyzing second order stochasticity of neural spiking under stimuli-bundle exposure. The Annals of Applied Statistics, 15(1), 41-63.
Tokdar, ST (2021). neuromplex: Neural Multiplexing Analysis. The Comprehensive R Archive Network.
Mohl, JT, Caruso, VC, Tokdar, ST and Groh, JM (2020). Sensitivity and specificity of a Bayesian single trial analysis for time varying neural signals. Neurons, Behavior, Data Analysis, and Theory, 3(1).
Caruso, VC, Mohl, JT, Glynn, C, …, Tokdar, ST and Groh, JM (2018). Single neurons may encode simultaneous stimuli by switching between activity patterns. Nature Communication 9, 2715.

Teaching

Over the years, I have taught a number of theory heavy core and elective courses at undergraduate and graduate levels (e.g., STA 250 and STA 732). My approach to statistics teaching, at least in recent years, has focused on exploring and understanding what makes statistics an academic discipline of its own. This is not a trivial question in today's world where data awareness and data science/analytic skills are far more pervasive than what could be imagined even twenty years back. Teaching statistics as a cookbook of data analysis methods was never exciting, but now it feels woefully outdated. A more elegant view of statistics as a branch of applied mathematics for taking decisions under uncertainty is more useful and reassuring. But it does not quite capture the full breadth of what it means to think like a statistician. For that, one needs to recognize that statistics is indeed founded on two very well defined principles of how to use the language of probability to quantify and communicate evidence in the face of uncertainty. It is imperative that in teaching statistics we expose students to historical references on how statistical thinking has evolved over centuries and how the conflicting and complementing aspects of the Bayesian and Frequentist principles are equally important to budding statisticians to know and appreciate and apply critically in their own work. Below is a partial list of courses I have taught recently.

STA 240: Statistics for Probability. This undergraduate level course introduces calculus based probability and builds toward it application in statistical inference. Last taught in Fall 2021.
STA 532: Theory of Inference. This master's level course examines how modern statistical thinking relies upon both Frequentist and Bayesian principles of statistical inference. Last taught in Spring 2021.
STA 790: Advanced Regression (Special Topics). This PhD elective course discusses Bayesian smoothing within a regression context with motivations drawn from causal analysis of observational data and extreme value analysis. Last taught in Spring 2021.

Software

A lot of my work involves scientific computing with Bayesian models. I mostly write codes in the R programming language, while using compiled C codes in the background for speed ups in iterative computation, especially for complex Markov chain Monte Carlo based computation. I have authored two R packages that are hosted on The Comprehensive R Archive Network (CRAN).

Tokdar, ST (2022). sbde: Semiparametric Bayesian Density Estimation. The Comprehensive R Archive Network.
Tokdar, ST (2021). neuromplex: Neural Multiplexing Analysis. The Comprehensive R Archive Network.
Tokdar, ST and Cunningham, E (2019). qrjoint: Joint Estimation in Linear Quantile Regression. The Comprehensive R Archive Network.

Additional code pieces associated with other papers are available here. However, their use will require additional effort from the user. Time permitting, I will be happy to offer some assistance with implementation or customization.

Doctoral Alumni

Silvia Montagna, 2013 -- Università degli Studi di Torino
Yun Yang, 2014 -- University of Illinois, Urbana-Champaign
Shaan Qamar, 2015 -- Google
Chris Glynn, 2016 -- Zillow Research
Michael Lindon, 2018 -- Netflix
Erika Cunningham, 2020
Sheng Jiang, 2021 -- Duke University
Xu Chen, 2021 -- Facebook

Work with me!

I am looking for a postdoctoral researcher to work with me and Professor Jennifer Groh on the neuroscience research outlined above, focusing on the statistical theory and methodology. Please find the job posting here and apply ASAP! This is one of two postdoc positions to be funded by our joint NIH award on "Information Preservation in Neural Codes". Please see here for the other position with a neuroscience lead. Duke University and the triangle area are great places to live and work!

Note: For students interested in graduate research under my supervision, please apply to the MSS or PhD programs offered by Duke Statistical Science. Graduate students are not directly recruited by faculty. Instead, they are admitted to these programs through admission process approved by the department and the university.