Abstract: Distributed statistical estimation and inference have gained significant importance due to the prevalence of data distributed across various locations in many applications. The distributed setting arises from constraints related to data size, privacy considerations, or security concerns. These scenarios are commonly encountered in diverse fields such as medicine, finance, and business.
In this talk, we discuss optimal distributed nonparametric regression in a decision theoretical framework, considering constraints on communication or privacy. By establishing minimax rates of convergence, we quantify the tradeoff between communication/privacy costs and statistical accuracy, particularly under independent communication protocols. Additionally, we develop estimation procedures that achieve rate-optimality. The results we present uncover novel and interesting phenomena, suggesting promising avenues for further exploration.
Papers: