Формализация квантовоподобных концептуальных полей

leventov · 06.Февраль.2024 10:32:00

Abstract:

In this article we present a new modelling framework for structured concepts using a category-theoretic generalisation of conceptual spaces, and show how the conceptual representations can be learned automatically from data, using two very different instantiations: one classical and one quantum. A contribution of the work is a thorough category-theoretic formalisation of our framework. We claim that the use of category theory, and in particular the use of string diagrams to describe quantum processes, helps elucidate some of the most important features of our approach. We build upon Gardenfors’ classical framework of conceptual spaces, in which cognition is modelled geometrically through the use of convex spaces, which in turn factorise in terms of simpler spaces called domains. We show how concepts from the domains of shape, colour, size and position can be learned from images of simple shapes, where concepts are represented as Gaussians in the classical implementation, and quantum effects in the quantum one. In the classical case we develop a new model which is inspired by the Beta-VAE model of concepts, but is designed to be more closely connected with language, so that the names of concepts form part of the graphical model. In the quantum case, concepts are learned by a hybrid classical-quantum network trained to perform concept classification, where the classical image processing is carried out by a convolutional neural network and the quantum representations are produced by a parameterised quantum circuit. Finally, we consider the question of whether our quantum models of concepts can be considered conceptual spaces in the Gardenfors sense.

Из вступления:

In this article we present a new modelling framework for concepts based on the mathematical formalism used in quantum theory, and demonstrate how the conceptual representations can be learned automatically from data, using both classical and quantum-inspired models. A contribution of the work is a thorough category-theoretic formalisation of our framework, following Bolt et al. (2019) and Tull (2021). Formalisation of conceptual models is not new (Ganter & Obiedkov, 2016), but we claim that the use of category theory (Fong, 2019), and in particular the use of string diagrams to describe quantum processes (Coecke & Kissinger, 2017), helps elucidate some of the most important features of our approach to concept modelling. This aspect of our work also fits with the recent push to introduce category theory into machine learning and AI more broadly. The motivation is to make deep learning less ad-hoc and less driven by heuristics, by viewing deep learning models through the compositional lens of category theory (Shiebler et al., 2021).

Murphy (2002, p.1) describes concepts as “the glue that holds our mental world together”. But how should concepts be modelled and represented mathematically? There are many modelling frameworks in the literature, including the classical theory (Margolis & Laurence, 2022), the prototype theory (Rosch, 1973), and the theory theory (Gopnik & Meltzoff, 1997). Here we build upon G¨ardenfors’ framework of conceptual spaces (G¨ardenfors, 2004, 2014), in which cognition is modelled geometrically through the use of convex spaces, which in turn factorise in terms of simpler spaces called domains.

Our category-theoretic formalisation of conceptual spaces allows flexibility in how the framework is instantiated and then implemented, with the particular instantiation determined by the choice of category. First we show how the framework can be instantiated and implemented classically, by using the formalisation of “fuzzy” conceptual spaces from Tull (2021), and developing a probabilistic model based on Variational Autoencoders (VAEs) (Rezende et al., 2014; Kingma & Welling, 2014). Having “fuzzy” probabilistic representations not only extends G¨ardenfors’ framework in a useful way, it also provides a natural mechanism for dealing with the vagueness inherent in the human conceptual system, and allows us to draw on the toolkit from machine learning to provide effective learning mechanisms. Our new model—which we call the Conceptual VAE—is an extension of the β-VAE from Higgins et al. (2017), with the concepts having explicit labels and represented as multivariate Gaussians in a factored conceptual space.

ushelspat · 05.Март.2024 15:40:44

Вот, кстати, из Системного мышления цитата:
Это «попсовое» понимание слова «система» было унаследовано и современными системами AI (Bard, Claude, ChatGPT и т.д.). Увы, с ними нельзя поддержать разговор про современное системное мышление: в бытовом понимании слово «система» не тип объекта, как в системном подходе, а «почти синоним» объекта. Вместо теоретической теории понятий и её строгой/формальной типизации (медленное мышление S2 по Канеману32) в системах искусственного интеллекта используется прототипная теория понятий с её нестрогими аналогиями как основой мышления (быстрое мышление S1 по Канеману), причём не из современного системного подхода, а из обыденной речи. Поэтому осторожней беседуйте с системами AI (а также с обывателями) на тему системного мышления, слово «система» знают все — но для них всех это не тип объекта из системного подхода, и не все могут отследить строгость использования типа!

Т.е. опять из данных выделение концептов и доменов vs построение концептов/объяснений из догадок и их заземление/проверка.

advat · 05.Март.2024 17:12:05

Всё ровно так как Вы описываете. В качестве заплатки, со многими-многими оговорками, чат gpt 3.5 на платформе Aisystant справляется за счёт того, что выдаёт ссылки на источники.

Далее, если пройти по ссылкам и собрать в качестве датасета нужные по теме статьи, а потом их подгрузить в чат gpt 4, то, при наличии соответствующего промта, качество ответов нейросетки можно повысить. Но, конечно же, окончательно проблема ещё не решена.

На лаборатории ИИ ШСМ часто высказывается мнение, что современные и доступные ШСМ нейросетки ещё очень слабы для ведения “содержательных бесед”. Как гипотеза, сейчас рассматривается вариант, что если сетка по результатам тестов препода, будет ошибаться много меньше, чем студент, то такую нейросетку уже можно рекомендовать студентам, опять же с много-много оговорками, в качестве тьютора 24/7. Но для этого, вероятно, следует дождаться того момента, когда новое поколение нейросетей станет столь же доступным, как и чат gpt 3.5 сейчас.