M. Alam, . Ashad, . Fukumizu, W. Kenji, and Y. , Influence function and robust variant of kernel canonical correlation analysis

. Alon, . Noga, . Matias, . Yossi, and M. Szegedy, The Space Complexity of Approximating the Frequency Moments, Journal of Computer and System Sciences, vol.58, issue.1, pp.137-147, 1999.
DOI : 10.1006/jcss.1997.1545

N. Aronszajn, Theory of reproducing kernels. Transactions of the, pp.337-404, 1950.

J. Audibert and O. Catoni, Robust linear least squares regression. The Annals of Statistics, pp.2766-2794, 2011.
DOI : 10.1214/11-aos918

URL : https://hal.archives-ouvertes.fr/hal-00522534

. Balasubramanian, . Krishnakumar, . Li, . Tong, and M. Yuan, On the optimality of kernel-embedding based goodnessof-fit tests, p.2017

L. Baringhaus, F. , and C. , On a new multivariate two-sample test, Journal of Multivariate Analysis, vol.88, issue.1, pp.190-206, 2004.
DOI : 10.1016/S0047-259X(03)00079-4

G. Blanchard, A. Deshmukh, . Anand, . Dogan, . Urun et al., Domain generalization by marginal transfer learning

O. Catoni, Challenging the empirical mean and empirical variance: A deviation study, Annales de l'Institut Henri Poincaré Probabilités et Statistiques, pp.1148-1185, 2012.
DOI : 10.1214/11-AIHP454

URL : https://hal.archives-ouvertes.fr/hal-00517206

O. Catoni and I. Giulini, Dimension-free PAC- Bayesian bounds for matrices, vectors, and linear least squares regression, p.2017

M. Collins and N. Duffy, Convolution kernels for natural language, NIPS, pp.625-632, 2001.

M. Cuturi, Fast global alignment kernels, ICML, pp.929-936, 2011.

M. Cuturi, . Fukumizu, . Kenji, . Vert, and . Jean-philippe, Semigroup kernels on measures, Journal of Machine Learning Research, vol.6, pp.1169-1198, 2005.

. Devroye, . Luc, . Lerasle, . Matthieu, . Lugosi et al., Sub-Gaussian mean estimators. The Annals of Statistics, pp.2695-2725, 2016.
DOI : 10.1214/16-aos1440

URL : https://hal.archives-ouvertes.fr/hal-01204519

. Fukumizu, . Kenji, . Gretton, . Arthur, . Sun et al., Kernel measures of conditional dependence, NIPS, pp.498-496, 2008.

. Fukumizu, . Kenji, . Song, . Le, and A. Gretton, Kernel Bayes' rule: Bayesian inference with positive definite kernels, Journal of Machine Learning Research, vol.14, pp.3753-3783, 2013.

T. Gärtner, . Flach, A. Peter, . Kowalczyk, . Adam et al., Multi-instance kernels, ICML, pp.179-186, 2002.

A. Gretton, . Fukumizu, . Kenji, C. Teo, . Hui et al., A kernel statistical test of independence, NIPS, pp.585-592, 2008.

A. Gretton, K. M. Borgwardt, . Rasch, J. Malte, . Schölkopf et al., A kernel two-sample test, Journal of Machine Learning Research, vol.13, pp.723-773, 2012.

M. Outlier, Robust Mean Embedding Estimation by Median-of-Means Guevara Cross product kernels for fuzzy set similarity, FUZZ-IEEE, pp.1-6, 2017.

. Györfi, . László, . Kohler, . Michael, . Krzyzak et al., A Distribution-Free Theory of Nonparametric Regression, 2002.
DOI : 10.1007/b97848

. Harchaoui, . Zaid, . Bach, . Francis, and E. Moulines, Testing for homogeneity with kernel Fisher discriminant analysis, NIPS, pp.609-616, 2007.
URL : https://hal.archives-ouvertes.fr/hal-00270806

D. Haussler, Convolution kernels on discrete structures, 1999.

M. Hein and O. Bousquet, Hilbertian metrics and positive definite kernels on probability measures, AISTATS, pp.136-143, 2005.

I. Tolstikhin, K. Bharath, . Sriperumbudur, and B. Schölkopf, Minimax estimation of maximal mean discrepancy with radial kernels, NIPS, 1930.

T. Jebara, . Kondor, . Risi, and A. Howard, Probability product kernels, Journal of Machine Learning Research, vol.5, pp.819-844, 2004.

M. R. Jerrum, G. Valiant, V. Leslie, and V. Vazirani, Random generation of combinatorial structures from a uniform distribution, Theoretical Computer Science, vol.43, issue.2-3, pp.169-188, 1986.
DOI : 10.1016/0304-3975(86)90174-X

Y. Jiao, . Vert, and . Jean-philippe, The Kendall and Mallows Kernels for Permutations, ICML (PMLR), pp.2982-2990, 2016.
DOI : 10.1109/TPAMI.2017.2719680

URL : https://hal.archives-ouvertes.fr/hal-01279273

. Jitkrittum, . Wittawat, . Xu, . Wenkai, . Szabó et al., A linear-time kernel goodnessof-fit test, NIPS, pp.261-270, 2017.

H. Kashima and T. Koyanagi, Kernels for semistructured data, ICML, pp.291-298, 2002.

. Kim, . Been, . Khanna, . Rajiv, . Koyejo et al., Examples are not enough, learn to criticize! criticism for interpretability, NIPS, pp.2280-2288, 2016.

J. Kim and C. D. Scott, Robust kernel density estimation, Journal of Machine Learning Research, vol.13, pp.2529-2565, 2012.

V. Koltchinskii, Oracle Inequalities in Empirical Risk Minimization and Sparse Recovery Problems, 2011.
DOI : 10.1007/978-3-642-22147-7

R. Kondor and H. Pan, The multiscale Laplacian graph kernel, NIPS, pp.2982-2990, 2016.

. Kusano, . Genki, . Fukumizu, . Kenji, and Y. Hiraoka, Persistence weighted Gaussian kernel for topological data analysis, ICML, 2004.

H. Law, . Chung-leon, . Sutherland, J. Dougal, . Sejdinovic et al., Bayesian approaches to distribution regression, AISTATS, 2018.

L. Cam and L. , Convergence of estimates under dimensionality restrictions, The Annals of Statistics, vol.1, pp.38-53, 1973.

M. Ledoux and M. Talagrand, Probability in Banach spaces, 1991.
DOI : 10.1007/978-3-642-20212-4

J. Lloyd, . Robert, . Duvenaud, . David, . Grosse et al., Automatic construction and natural-language description of nonparametric regression models, AAAI Conference on Artificial Intelligence, pp.1242-1250, 2014.

. Lodhi, . Huma, C. Saunders, . Shawe-taylor, . John et al., Text classification using string kernels, Journal of Machine Learning Research, vol.2, pp.419-444, 2002.

G. Lugosi, . Mendelson, and . Shahar, Risk minimization by median-of-means tournaments

G. Lugosi, . Mendelson, and . Shahar, Sub-gaussian estimators of the mean of a random vector

A. F. Martins, . Smith, A. Noah, E. P. Xing, P. M. Aguiar et al., Nonextensive information theoretic kernels on measures, The Journal of Machine Learning Research, vol.10, pp.935-975, 2009.

S. Mendelson, Learning without Concentration, Journal of the ACM, vol.62, issue.3, pp.1-2125, 2015.
DOI : 10.1007/978-1-4757-2545-2

URL : http://arxiv.org/pdf/1401.0304.pdf

S. Minsker, Geometric median and robust estimation in Banach spaces, Bernoulli, vol.21, issue.4, pp.2308-2335, 2015.
DOI : 10.3150/14-BEJ645

S. Minsker and N. Strawn, Distributed statistical estimation and rates of convergence in normal approximation

M. Mooij, M. Joris, . Peters, . Jonas, . Janzing et al., Distinguishing cause from effect using observational data: Methods and benchmarks, Journal of Machine Learning Research, vol.17, pp.1-102, 2016.

. Muandet, . Krikamol, . Fukumizu, . Kenji, . Dinuzzo et al., Learning from distributions via support measure machines, NIPS, pp.10-18, 2011.

. Muandet, . Krikamol, . Sriperumbudur, K. Bharath, . Fukumizu et al., Kernel mean shrinkage estimators, Journal of Machine Learning Research, vol.17, pp.1-41, 2016.

. Muandet, . Krikamol, . Fukumizu, . Kenji, . Sriperumbudur et al., Kernel Mean Embedding of Distributions: A Review and Beyond, Machine Learning, pp.1-141, 2017.
DOI : 10.1561/2200000060

A. Müller, Integral Probability Metrics and Their Generating Classes of Functions, Advances in Applied Probability, vol.28, issue.02, pp.429-443, 1997.
DOI : 10.2307/1427540

A. S. Nemirovski and D. B. Yudin, Problem complexity and method efficiency in optimization, 1983.

. Park, . Mijung, . Jitkrittum, . Wittawat, and D. Sejdinovic, K2-ABC: Approximate Bayesian computation with kernel embeddings, AISTATS (PMLR), pp.51398-407, 2016.

. Pfister, . Niklas, . Bühlmann, . Peter, . Schölkopf et al., Kernel-based tests for joint independence, Journal of the Royal Statistical Society: Series B (Statistical Methodology), vol.27, issue.1, pp.2017-1467
DOI : 10.1214/12-AOS1041

URL : http://onlinelibrary.wiley.com/doi/10.1111/rssb.12235/pdf

B. Schölkopf and A. J. Smola, Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond, 2002.

B. Schölkopf, . Muandet, . Krikamol, . Fukumizu, . Kenji et al., Computing functions of random variables via reproducing kernel Hilbert space representations, Statistics and Computing, vol.2, issue.3, pp.755-766, 2015.
DOI : 10.1137/0114046

D. Sejdinovic, . Sriperumbudur, K. Bharath, . Gretton, . Arthur et al., Equivalence of distance-based and RKHS-based statistics in hypothesis testing, The Annals of Statistics, vol.41, issue.5, pp.2263-2291, 2013.
DOI : 10.1214/13-AOS1140

J. Shawe-taylor and N. Cristianini, Kernel Methods for Pattern Analysis, 2004.
DOI : 10.1017/CBO9780511809682

B. Sinova, . Gonzlez-rodríguez, . Gil, S. Aelst, and . Van, M-estimators of location for functional data, Bernoulli, vol.24, issue.3, pp.2328-2357, 2018.
DOI : 10.3150/17-BEJ929

A. Smola, . Gretton, . Arthur, . Song, . Le et al., A Hilbert space embedding for distributions, ALT, pp.13-31, 2007.

L. Song, . Gretton, . Arthur, . Bickson, . Danny et al., Kernel belief propagation, AISTATS, pp.707-715, 2011.

. Sriperumbudur, K. Bharath, . Gretton, . Arthur, . Fukumizu et al., Hilbert space embeddings and metrics on probability measures, Journal of Machine Learning Research, vol.11, pp.1517-1561, 2010.

I. Steinwart and A. Christmann, Support Vector Machines, 2008.

. Szabó, . Zoltán, . Sriperumbudur, . Bharath, . Póczos et al., Learning theory for distribution regression, Journal of Machine Learning Research, vol.17, issue.152, pp.1-40, 2016.

G. J. Székely and M. L. Rizzo, Testing for equal distributions in high dimension, InterStat, vol.5, 2004.

G. J. Székely and M. L. Rizzo, A new test for multivariate normality, Journal of Multivariate Analysis, vol.93, issue.1, pp.58-80, 2005.
DOI : 10.1016/j.jmva.2003.12.002

. Tolstikhin, . Ilya, . Sriperumbudur, K. Bharath, and . Muandet, Minimax estimation of kernel mean embeddings, Journal of Machine Learning Research, vol.18, pp.1-47, 2017.

S. V. Vishwanathan, . Schraudolph, N. Nicol, . Kondor, . Risi et al., Graph kernels, Journal of Machine Learning Research, vol.11, pp.1201-1242, 2010.

M. Yamada, . Umezu, . Yuta, . Fukumizu, . Kenji et al., Post selection inference with kernels

. Zaheer, . Manzil, . Kottur, . Satwik, . Ravanbakhsh et al., Deep sets, NIPS, pp.3394-3404, 2017.

. Zhang, . Kun, . Schölkopf, . Bernhard, . Muandet et al., Domain adaptation under target and conditional shift, Journal of Machine Learning Research, vol.28, issue.3, pp.819-827, 2013.

A. A. Zinger, A. V. Kakosyan, and L. B. Klebanov, A characterization of distributions by mean values of statistics and certain probabilistic metrics, Journal of Soviet Mathematics, vol.200, issue.No. 4, 1992.
DOI : 10.1007/BF01099119

V. M. Zolotarev, Probability metrics. Theory of Probability and its Applications, pp.278-302, 1983.