中国科学技术信息研究所--国家工程技术数字图书馆

1. Variational Learning for Switching State-Space Models

[机翻] 切换状态空间模型的变分学习

[期刊] Zoubin Ghahramani Geoffrey E. Hinton 《Neural computation》 2000年12卷4期共1694页

摘要 : We introduce a new statistical model for time series that iteratively seg- ments data into regimes with approximately linear dynamics and learns the parameters of each of these linear regimes. This model combines and generalizes t... 展开

2. Sampling as optimization in the space of measures: The Langevin dynamics as a composite optimization problem OA

[期刊] Andre Wibisono 《JMLR: Workshop and Conference Proceedings》 2018年75卷共935页

摘要 : We study sampling as optimization in the space of measures. We focus on gradient flow-based optimization with the Langevin dynamics as a case study. We investigate the source of the bias of the unadjusted Langevin algorithm (ULA) ... 展开

3. Sampling as optimization in the space of measures: The Langevin dynamics as a composite optimization problem OA

[期刊] Andre Wibisono 《JMLR: Workshop and Conference Proceedings》 2017年75卷共935页

4. Reducibility and Statistical-Computational Gaps from Secret Leakage OA

[期刊] Matthew Brennan Guy Bresler 《JMLR: Workshop and Conference Proceedings》 2020年125卷共200页

摘要 : Inference problems with conjectured statistical-computational gaps are ubiquitous throughout modern statistics, computer science, statistical physics and discrete probability. While there has been success evidencing these gaps fro... 展开 Inference problems with conjectured statistical-computational gaps are ubiquitous throughout modern statistics, computer science, statistical physics and discrete probability. While there has been success evidencing these gaps from the failure of restricted classes of algorithms, progress towards a more traditional reduction-based approach to computational complexity in statistical inference has been limited. These average-case problems are each tied to a different natural distribution, high-dimensional structure and conjecturally hard parameter regime, leaving reductions among them technically challenging. Despite a flurry of recent success in developing such techniques, existing reductions have largely been limited to inference problems with similar structure – primarily mapping among problems representable as a sparse submatrix signal plus a noise matrix, which is similar to the common starting hardness assumption of planted clique ($\textsc{pc}$). The insight in this work is that a slight generalization of the planted clique conjecture – secret leakage planted clique ($\textsc{pc}_\rho$), wherein a small amount of information about the hidden clique is revealed – gives rise to a variety of new average-case reduction techniques, yielding a web of reductions relating statistical problems with very different structure. Based on generalizations of the planted clique conjecture to specific forms of $\textsc{pc}_\rho$, we deduce tight statistical-computational tradeoffs for a diverse range of problems including robust sparse mean estimation, mixtures of sparse linear regressions, robust sparse linear regression, tensor PCA, variants of dense $k$-block stochastic block models, negatively correlated sparse PCA, semirandom planted dense subgraph, detection in hidden partition models and a universality principle for learning sparse mixtures. This gives the first reduction-based evidence for a number of conjectured statistical-computational gaps. We introduce a number of new average-case reduction techniques that also reveal novel connections to combinatorial designs based on the incidence geometry of $\mathbb{F}_r^t$ and to random matrix theory. In particular, we show a convergence result between Wishart and inverse Wishart matrices that may be of independent interest. The specific hardness conjectures for $\textsc{pc}_\rho$ implying our statistical-computational gaps all are in correspondence with natural graph problems such as $k$-partite, bipartite and hypergraph variants of $\textsc{pc}$. Hardness in a $k$-partite hypergraph variant of $\textsc{pc}$ is the strongest of these conjectures and sufficient to establish all of our computational lower bounds. We also give evidence for our $\textsc{pc}_\rho$ hardness conjectures from the failure of low-degree polynomials and statistical query algorithms. Our work raises a number of open problems and suggests that previous technical obstacles to average-case reductions may have arisen because planted clique is not the right starting point. An expanded set of hardness assumptions, such as $\textsc{pc}_\rho$, may be a key first step towards a more complete theory of reductions among statistical problems. 收起

关键词 : Statistical-computational tradeoffsaverage-case complexityaverage-case reductionsplanted cliquesecret leakage

5. Short text topic modelling approaches in the context of big data: taxonomy, survey, and analysis

[期刊] Murshed, Belal Abdullah Hezam Mallappa, Suresha Abawajy, Jemal Saif, Mufeed Ahmed Naji Al-ariki, Hasib Daowd Esmail Abdulwahab, Hudhaifa Mohammed 《Artificial Intelligence Review: An International Science and Engineering Journal》 2023年56卷6期共128页

摘要 : Social media platforms such as (Twitter, Facebook, and Weibo) are being increasingly embraced by individuals, groups, and organizations as a valuable source of information. This social media generated information comes in the form... 展开

关键词 : Big data Social media Short text topic modeling Data streaming Coherence Sparseness Deep learning topic modeling SOCIAL MEDIA SEMANTIC ANALYSIS HOT TOPICS R PACKAGE TWITTER LDA DISCOVERY FRAMEWORK AGGREGATION COLLECTION

原文获取

6. 360 degree view of cross-domain opinion classification: a survey

[期刊] Singh, Rahul Kumar Sachan, Manoj Kumar Patel, R. B. 《Artificial Intelligence Review: An International Science and Engineering Journal》 2021年54卷2期共122页

摘要 : In the field of natural language processing and text mining, sentiment analysis (SA) has received huge attention from various researchers' across the globe. By the prevalence of Web 2.0, user's became more vigilant to share, promo... 展开

关键词 : Opinion mining Sentiment analysis Cross-domain opinion classification Domain adaptation Transfer learning Machine learning

原文获取

7. A survey, taxonomy and progress evaluation of three decades of swarm optimisation

[期刊] Liu, Jing Anavatti, Sreenatha Garratt, Matthew Tan, Kay Chen Abbass, Hussein A. 《Artificial Intelligence Review: An International Science and Engineering Journal》 2022年55卷5期共119页

摘要 : While the concept of swarm intelligence was introduced in 1980s, the first swarm optimisation algorithm was introduced a decade later, in 1992. In this paper, nineteen representative original swarm optimisation algorithms are anal... 展开

关键词 : Swarm intelligence Optimisation algorithm Taxonomy Evolutionary computation ANT COLONY OPTIMIZATION ARTIFICIAL BEE COLONY BACTERIAL FORAGING OPTIMIZATION FROG-LEAPING ALGORITHM BRAIN STORM OPTIMIZATION GREY WOLF OPTIMIZER ENVIRONMENTAL ECONOMIC-DISPATCH CUCKOO SEARCH ALGORITHM BAT ALGORITHM SCHEDULING PROBLEM

原文获取

8. Reducibility and Computational Lower Bounds for Problems with Planted Sparse Structure OA

[期刊] Matthew Brennan Guy Bresler Wasim Huleihel 《JMLR: Workshop and Conference Proceedings》 2018年75卷共119页

摘要 : Recently, research in unsupervised learning has gravitated towards exploring statistical-computational gaps induced by sparsity. A line of work initiated in Berthet and Rigollet (2013) has aimed to explain these gaps through reduc... 展开 Recently, research in unsupervised learning has gravitated towards exploring statistical-computational gaps induced by sparsity. A line of work initiated in Berthet and Rigollet (2013) has aimed to explain these gaps through reductions to conjecturally hard problems from complexity theory. However, the delicate nature of average-case reductions has limited the development of techniques and often led to weaker hardness results that only apply to algorithms robust to different noise distributions or that do not need to know the parameters of the problem. We introduce several new techniques to give a web of average-case reductions showing strong computational lower bounds based on the planted clique conjecture. Our new lower bounds include: Planted Independent Set: We show tight lower bounds for detecting a planted independent set of size $k$ in a sparse Erd?s-Rényi graph of size $n$ with edge density $\tilde{\Theta}(n^{-\alpha})$. Planted Dense Subgraph: If $p > q$ are the edge densities inside and outside of the community, we show the first lower bounds for the general regime $q = \tilde{\Theta}(n^{-\alpha})$ and $p - q = \tilde{\Theta}(n^{-\gamma})$ where $\gamma \ge \alpha$, matching the lower bounds predicted in Chen and Xu (2016). Our lower bounds apply to a deterministic community size $k$, resolving a question raised in Hajek et al. (2015). Biclustering: We show strong lower bounds for Gaussian biclustering as a simple hypothesis testing problem to detect a uniformly at random planted flat $k \times k$ submatrix. Sparse Rank-1 Submatrix: We show that detection in the sparse spiked Wigner model is often harder than biclustering, and are able to obtain two different tight lower bounds for these problems with different reductions from planted clique. Sparse PCA: We give a reduction between rank-1 submatrix and sparse PCA to obtain tight lower bounds in the less sparse regime $k \gg \sqrt{n}$, when the spectral algorithm is optimal over the SDP. We give an alternate reduction recovering the lower bounds of Berthet and Rigollet (2013) and Gao et al. (2017) in the simple hypothesis testing variant of sparse PCA. We also observe a subtlety in the complexity of sparse PCA that arises when the planted vector is biased. Subgraph Stochastic Block Model: We introduce a model where two small communities are planted in an Erd?s-Rényi graph of the same average edge density and give tight lower bounds yielding different hard regimes than planted dense subgraph. Our results demonstrate that, despite the delicate nature of average-case reductions, using natural problems as intermediates can often be beneficial, as is the case in worst-case complexity. Our main technical contribution is to introduce a set of techniques for average-case reductions that: (1) maintain the level of signal in an instance of a problem; (2) alter its planted structure; and (3) map two initial high-dimensional distributions simultaneously to two target distributions approximately under total variation. We also give algorithms matching our lower bounds and identify the information-theoretic limits of the models we consider. 收起

9. Reducibility and Computational Lower Bounds for Problems with Planted Sparse Structure OA

[期刊] Matthew Brennan Guy Bresler Wasim Huleihel 《JMLR: Workshop and Conference Proceedings》 2017年75卷共119页

10. Improper Learning for Non-Stochastic Control OA

[期刊] Max Simchowitz Karan Singh Elad Hazan 《JMLR: Workshop and Conference Proceedings》 2020年125卷共117页

摘要 : We consider the problem of controlling a possibly unknown linear dynamical system with adversarial perturbations, adversarially chosen convex loss functions, and partially observed states, known as non-stochastic control. We intro... 展开