Skip to main content

Showing 1–34 of 34 results for author: Bordes, A

.
  1. arXiv:2011.05448  [pdf, other

    cs.CL

    Generating Fact Checking Briefs

    Authors: Angela Fan, Aleksandra Piktus, Fabio Petroni, Guillaume Wenzek, Marzieh Saeidi, Andreas Vlachos, Antoine Bordes, Sebastian Riedel

    Abstract: Fact checking at scale is difficult -- while the number of active fact checking websites is growing, it remains too small for the needs of the contemporary media ecosystem. However, despite good intentions, contributions from volunteers are often error-prone, and thus in practice restricted to claim detection. We investigate how to increase the accuracy and efficiency of fact checking by providing… ▽ More

    Submitted 10 November, 2020; originally announced November 2020.

  2. arXiv:2006.12442  [pdf, other

    cs.CL cs.AI

    Open-Domain Conversational Agents: Current Progress, Open Problems, and Future Directions

    Authors: Stephen Roller, Y-Lan Boureau, Jason Weston, Antoine Bordes, Emily Dinan, Angela Fan, David Gunning, Da Ju, Margaret Li, Spencer Poff, Pratik Ringshia, Kurt Shuster, Eric Michael Smith, Arthur Szlam, Jack Urbanek, Mary Williamson

    Abstract: We present our view of what is necessary to build an engaging open-domain conversational agent: covering the qualities of such an agent, the pieces of the puzzle that have been built so far, and the gaping holes we have not filled yet. We present a biased view, focusing on work done by our own group, while citing related work in each area. In particular, we discuss in detail the properties of cont… ▽ More

    Submitted 13 July, 2020; v1 submitted 22 June, 2020; originally announced June 2020.

  3. arXiv:2005.00481  [pdf, other

    cs.CL

    ASSET: A Dataset for Tuning and Evaluation of Sentence Simplification Models with Multiple Rewriting Transformations

    Authors: Fernando Alva-Manchego, Louis Martin, Antoine Bordes, Carolina Scarton, Benoît Sagot, Lucia Specia

    Abstract: In order to simplify a sentence, human editors perform multiple rewriting transformations: they split it into several shorter sentences, paraphrase words (i.e. replacing complex words or phrases by simpler synonyms), reorder components, and/or delete information deemed unnecessary. Despite these varied range of possible text alterations, current models for automatic sentence simplification are eva… ▽ More

    Submitted 1 May, 2020; originally announced May 2020.

    Comments: Accepted to ACL 2020 (camera-ready version)

  4. arXiv:2005.00352  [pdf, other

    cs.CL cs.LG

    MUSS: Multilingual Unsupervised Sentence Simplification by Mining Paraphrases

    Authors: Louis Martin, Angela Fan, Éric de la Clergerie, Antoine Bordes, Benoît Sagot

    Abstract: Progress in sentence simplification has been hindered by a lack of labeled parallel simplification data, particularly in languages other than English. We introduce MUSS, a Multilingual Unsupervised Sentence Simplification system that does not require labeled simplification data. MUSS uses a novel approach to sentence simplification that trains strong models using sentence-level paraphrase data ins… ▽ More

    Submitted 16 April, 2021; v1 submitted 1 May, 2020; originally announced May 2020.

  5. arXiv:2004.12744  [pdf, other

    cs.CL

    Augmenting Transformers with KNN-Based Composite Memory for Dialogue

    Authors: Angela Fan, Claire Gardent, Chloe Braud, Antoine Bordes

    Abstract: Various machine learning tasks can benefit from access to external information of different modalities, such as text and images. Recent work has focused on learning architectures with large memories capable of storing this knowledge. We propose augmenting generative Transformer neural networks with KNN-based Information Fetching (KIF) modules. Each KIF module learns a read operation to access fixe… ▽ More

    Submitted 10 November, 2020; v1 submitted 27 April, 2020; originally announced April 2020.

  6. arXiv:1910.08435  [pdf, other

    cs.CL

    Using Local Knowledge Graph Construction to Scale Seq2Seq Models to Multi-Document Inputs

    Authors: Angela Fan, Claire Gardent, Chloe Braud, Antoine Bordes

    Abstract: Query-based open-domain NLP tasks require information synthesis from long and diverse web results. Current approaches extractively select portions of web text as input to Sequence-to-Sequence models using methods such as TF-IDF ranking. We propose constructing a local graph structured knowledge base for each query, which compresses the web search information and reduces redundancy. We show that by… ▽ More

    Submitted 18 October, 2019; originally announced October 2019.

  7. arXiv:1910.02677  [pdf, other

    cs.CL

    Controllable Sentence Simplification

    Authors: Louis Martin, Benoît Sagot, Éric de la Clergerie, Antoine Bordes

    Abstract: Text simplification aims at making a text easier to read and understand by simplifying grammar and structure while keeping the underlying information identical. It is often considered an all-purpose generic task where the same simplification is suitable for all; however multiple audiences can benefit from simplified text in different ways. We adapt a discrete parametrization mechanism that provide… ▽ More

    Submitted 20 April, 2020; v1 submitted 7 October, 2019; originally announced October 2019.

    Comments: Code and models: https://github.com/facebookresearch/access

  8. arXiv:1901.10746  [pdf, other

    cs.CL

    Reference-less Quality Estimation of Text Simplification Systems

    Authors: Louis Martin, Samuel Humeau, Pierre-Emmanuel Mazaré, Antoine Bordes, Éric Villemonte de La Clergerie, Benoît Sagot

    Abstract: The evaluation of text simplification (TS) systems remains an open challenge. As the task has common points with machine translation (MT), TS is often evaluated using MT metrics such as BLEU. However, such metrics require high quality reference data, which is rarely available for TS. TS has the advantage over MT of being a monolingual task, which allows for direct comparisons to be made between th… ▽ More

    Submitted 30 January, 2019; originally announced January 2019.

    Journal ref: 1st Workshop on Automatic Text Adaptation (ATA), Nov 2018, Tilburg, Netherlands. https://www.ida.liu.se/~evere22/ATA-18/

  9. arXiv:1901.05415  [pdf, other

    cs.CL cs.AI cs.HC cs.LG stat.ML

    Learning from Dialogue after Deployment: Feed Yourself, Chatbot!

    Authors: Braden Hancock, Antoine Bordes, Pierre-Emmanuel Mazaré, Jason Weston

    Abstract: The majority of conversations a dialogue agent sees over its lifetime occur after it has already been trained and deployed, leaving a vast store of potential training signal untapped. In this work, we propose the self-feeding chatbot, a dialogue agent with the ability to extract new training examples from the conversations it participates in. As our agent engages in conversation, it also estimates… ▽ More

    Submitted 13 June, 2019; v1 submitted 16 January, 2019; originally announced January 2019.

    Comments: ACL 2019

  10. arXiv:1811.00945  [pdf, other

    cs.CL

    Image Chat: Engaging Grounded Conversations

    Authors: Kurt Shuster, Samuel Humeau, Antoine Bordes, Jason Weston

    Abstract: To achieve the long-term goal of machines being able to engage humans in conversation, our models should captivate the interest of their speaking partners. Communication grounded in images, whereby a dialogue is conducted based on a given photo, is a setup naturally appealing to humans (Hu et al., 2014). In this work we study large-scale architectures and datasets for this goal. We test a set of n… ▽ More

    Submitted 29 April, 2020; v1 submitted 2 November, 2018; originally announced November 2018.

    Comments: ACL 2020

  11. arXiv:1810.10665  [pdf, other

    cs.CV cs.AI cs.CL

    Engaging Image Captioning Via Personality

    Authors: Kurt Shuster, Samuel Humeau, Hexiang Hu, Antoine Bordes, Jason Weston

    Abstract: Standard image captioning tasks such as COCO and Flickr30k are factual, neutral in tone and (to a human) state the obvious (e.g., "a man playing a guitar"). While such tasks are useful to verify that a machine understands the content of an image, they are not engaging to humans as captions. With this in mind we define a new task, Personality-Captions, where the goal is to be as engaging to humans… ▽ More

    Submitted 20 March, 2019; v1 submitted 24 October, 2018; originally announced October 2018.

  12. arXiv:1809.01984  [pdf, other

    cs.CL

    Training Millions of Personalized Dialogue Agents

    Authors: Pierre-Emmanuel Mazaré, Samuel Humeau, Martin Raison, Antoine Bordes

    Abstract: Current dialogue systems are not very engaging for users, especially when trained end-to-end without relying on proactive reengaging scripted strategies. Zhang et al. (2018) showed that the engagement level of end-to-end dialogue models increases when conditioning them on text personas providing some personalized back-story to the model. However, the dataset used in Zhang et al. (2018) is syntheti… ▽ More

    Submitted 6 September, 2018; originally announced September 2018.

    Comments: EMNLP 2018

  13. arXiv:1804.10490  [pdf, other

    cs.CL

    Weaver: Deep Co-Encoding of Questions and Documents for Machine Reading

    Authors: Martin Raison, Pierre-Emmanuel Mazaré, Rajarshi Das, Antoine Bordes

    Abstract: This paper aims at improving how machines can answer questions directly from text, with the focus of having models that can answer correctly multiple types of questions and from various types of texts, documents or even from large collections of them. To that end, we introduce the Weaver model that uses a new way to relate a question to a textual context by weaving layers of recurrent networks, wi… ▽ More

    Submitted 27 April, 2018; originally announced April 2018.

  14. arXiv:1804.00921  [pdf, other

    cs.LG stat.ML

    DeSIGN: Design Inspiration from Generative Networks

    Authors: Othman Sbai, Mohamed Elhoseiny, Antoine Bordes, Yann LeCun, Camille Couprie

    Abstract: Can an algorithm create original and compelling fashion designs to serve as an inspirational assistant? To help answer this question, we design and investigate different image generation models associated with different loss functions to boost creativity in fashion generation. The dimensions of our explorations include: (i) different Generative Adversarial Networks architectures that start from no… ▽ More

    Submitted 14 September, 2018; v1 submitted 3 April, 2018; originally announced April 2018.

  15. arXiv:1709.03856  [pdf, ps, other

    cs.CL

    StarSpace: Embed All The Things!

    Authors: Ledell Wu, Adam Fisch, Sumit Chopra, Keith Adams, Antoine Bordes, Jason Weston

    Abstract: We present StarSpace, a general-purpose neural embedding model that can solve a wide variety of problems: labeling tasks such as text classification, ranking tasks such as information retrieval/web search, collaborative filtering-based or content-based recommendation, embedding of multi-relational graphs, and learning word, sentence or document level embeddings. In each case the model works by emb… ▽ More

    Submitted 20 November, 2017; v1 submitted 12 September, 2017; originally announced September 2017.

  16. arXiv:1706.00409  [pdf, other

    cs.CV

    Fader Networks: Manipulating Images by Sliding Attributes

    Authors: Guillaume Lample, Neil Zeghidour, Nicolas Usunier, Antoine Bordes, Ludovic Denoyer, Marc'Aurelio Ranzato

    Abstract: This paper introduces a new encoder-decoder architecture that is trained to reconstruct images by disentangling the salient information of the image and the values of attributes directly in the latent space. As a result, after training, our model can generate different realistic versions of an input image by varying the attribute values. By using continuous attribute values, we can choose how much… ▽ More

    Submitted 28 January, 2018; v1 submitted 1 June, 2017; originally announced June 2017.

    Comments: NIPS 2017

  17. arXiv:1705.06476  [pdf, other

    cs.CL

    ParlAI: A Dialog Research Software Platform

    Authors: Alexander H. Miller, Will Feng, Adam Fisch, Jiasen Lu, Dhruv Batra, Antoine Bordes, Devi Parikh, Jason Weston

    Abstract: We introduce ParlAI (pronounced "par-lay"), an open-source software platform for dialog research implemented in Python, available at http://parl.ai. Its goal is to provide a unified framework for sharing, training and testing of dialog models, integration of Amazon Mechanical Turk for data collection, human evaluation, and online/reinforcement learning; and a repository of machine learning models… ▽ More

    Submitted 8 March, 2018; v1 submitted 18 May, 2017; originally announced May 2017.

  18. arXiv:1705.02364  [pdf, ps, other

    cs.CL

    Supervised Learning of Universal Sentence Representations from Natural Language Inference Data

    Authors: Alexis Conneau, Douwe Kiela, Holger Schwenk, Loic Barrault, Antoine Bordes

    Abstract: Many modern NLP systems rely on word embeddings, previously trained in an unsupervised manner on large corpora, as base features. Efforts to obtain embeddings for larger chunks of text, such as sentences, have however not been so successful. Several attempts at learning unsupervised representations of sentences have not reached satisfactory enough performance to be widely adopted. In this paper, w… ▽ More

    Submitted 8 July, 2018; v1 submitted 5 May, 2017; originally announced May 2017.

    Comments: EMNLP 2017

  19. arXiv:1704.00051  [pdf, other

    cs.CL

    Reading Wikipedia to Answer Open-Domain Questions

    Authors: Danqi Chen, Adam Fisch, Jason Weston, Antoine Bordes

    Abstract: This paper proposes to tackle open- domain question answering using Wikipedia as the unique knowledge source: the answer to any factoid question is a text span in a Wikipedia article. This task of machine reading at scale combines the challenges of document retrieval (finding the relevant articles) with that of machine comprehension of text (identifying the answer spans from those articles). Our a… ▽ More

    Submitted 27 April, 2017; v1 submitted 31 March, 2017; originally announced April 2017.

    Comments: ACL2017, 10 pages

  20. arXiv:1612.03969  [pdf, ps, other

    cs.CL

    Tracking the World State with Recurrent Entity Networks

    Authors: Mikael Henaff, Jason Weston, Arthur Szlam, Antoine Bordes, Yann LeCun

    Abstract: We introduce a new model, the Recurrent Entity Network (EntNet). It is equipped with a dynamic long-term memory which allows it to maintain and update a representation of the state of the world as it receives new data. For language understanding tasks, it can reason on-the-fly as it reads text, not just when it is required to answer a question or respond as is the case for a Memory Network (Sukhba… ▽ More

    Submitted 10 May, 2017; v1 submitted 12 December, 2016; originally announced December 2016.

    Journal ref: ICLR 2017

  21. arXiv:1606.03126  [pdf, other

    cs.CL

    Key-Value Memory Networks for Directly Reading Documents

    Authors: Alexander Miller, Adam Fisch, Jesse Dodge, Amir-Hossein Karimi, Antoine Bordes, Jason Weston

    Abstract: Directly reading documents and being able to answer questions from them is an unsolved challenge. To avoid its inherent difficulty, question answering (QA) has been directed towards using Knowledge Bases (KBs) instead, which has proven effective. Unfortunately KBs often suffer from being too restrictive, as the schema cannot support certain types of answers, and too sparse, e.g. Wikipedia contains… ▽ More

    Submitted 10 October, 2016; v1 submitted 9 June, 2016; originally announced June 2016.

  22. arXiv:1605.07683  [pdf, other

    cs.CL

    Learning End-to-End Goal-Oriented Dialog

    Authors: Antoine Bordes, Y-Lan Boureau, Jason Weston

    Abstract: Traditional dialog systems used in goal-oriented applications require a lot of domain-specific handcrafting, which hinders scaling up to new domains. End-to-end dialog systems, in which all components are trained from the dialogs themselves, escape this limitation. But the encouraging success recently obtained in chit-chat dialog may not carry over to goal-oriented settings. This paper proposes a… ▽ More

    Submitted 30 March, 2017; v1 submitted 24 May, 2016; originally announced May 2016.

    Comments: Accepted as a conference paper at ICLR 2017

  23. arXiv:1511.06931  [pdf, ps, other

    cs.CL cs.LG

    Evaluating Prerequisite Qualities for Learning End-to-End Dialog Systems

    Authors: Jesse Dodge, Andreea Gane, Xiang Zhang, Antoine Bordes, Sumit Chopra, Alexander Miller, Arthur Szlam, Jason Weston

    Abstract: A long-term goal of machine learning is to build intelligent conversational agents. One recent popular approach is to train end-to-end models on a large amount of real dialog transcripts between humans (Sordoni et al., 2015; Vinyals & Le, 2015; Shang et al., 2015). However, this approach leaves many questions unanswered as an understanding of the precise successes and shortcomings of each model is… ▽ More

    Submitted 19 April, 2016; v1 submitted 21 November, 2015; originally announced November 2015.

  24. arXiv:1511.02301  [pdf, other

    cs.CL

    The Goldilocks Principle: Reading Children's Books with Explicit Memory Representations

    Authors: Felix Hill, Antoine Bordes, Sumit Chopra, Jason Weston

    Abstract: We introduce a new test of how well language models capture meaning in children's books. Unlike standard language modelling benchmarks, it distinguishes the task of predicting syntactic function words from that of predicting lower-frequency words, which carry greater semantic content. We compare a range of state-of-the-art models, each with a different way of encoding what has been previously read… ▽ More

    Submitted 1 April, 2016; v1 submitted 6 November, 2015; originally announced November 2015.

  25. arXiv:1506.02075  [pdf, ps, other

    cs.LG cs.CL

    Large-scale Simple Question Answering with Memory Networks

    Authors: Antoine Bordes, Nicolas Usunier, Sumit Chopra, Jason Weston

    Abstract: Training large-scale question answering systems is complicated because training sources usually cover a small portion of the range of possible questions. This paper studies the impact of multitask and transfer learning for simple question answering; a setting for which the reasoning required to answer is quite easy, as long as one can retrieve the correct evidence given a question, which can be di… ▽ More

    Submitted 5 June, 2015; originally announced June 2015.

  26. arXiv:1506.00999  [pdf, ps, other

    cs.AI cs.CL cs.LG

    Combining Two And Three-Way Embeddings Models for Link Prediction in Knowledge Bases

    Authors: Alberto Garcia-Duran, Antoine Bordes, Nicolas Usunier, Yves Grandvalet

    Abstract: This paper tackles the problem of endogenous link prediction for Knowledge Base completion. Knowledge Bases can be represented as directed graphs whose nodes correspond to entities and edges to relationships. Previous attempts either consist of powerful systems with high capacity to model complex connectivity patterns, which unfortunately usually end up overfitting on rare relationships, or in app… ▽ More

    Submitted 2 June, 2015; originally announced June 2015.

    Comments: 26 pages

  27. arXiv:1502.05698  [pdf, ps, other

    cs.AI cs.CL stat.ML

    Towards AI-Complete Question Answering: A Set of Prerequisite Toy Tasks

    Authors: Jason Weston, Antoine Bordes, Sumit Chopra, Alexander M. Rush, Bart van Merriënboer, Armand Joulin, Tomas Mikolov

    Abstract: One long-term goal of machine learning research is to produce methods that are applicable to reasoning and natural language, in particular building an intelligent dialogue agent. To measure progress towards that goal, we argue for the usefulness of a set of proxy tasks that evaluate reading comprehension via question answering. Our tasks measure understanding in several ways: whether a system is a… ▽ More

    Submitted 31 December, 2015; v1 submitted 19 February, 2015; originally announced February 2015.

  28. arXiv:1410.3916  [pdf, ps, other

    cs.AI cs.CL stat.ML

    Memory Networks

    Authors: Jason Weston, Sumit Chopra, Antoine Bordes

    Abstract: We describe a new class of learning models called memory networks. Memory networks reason with inference components combined with a long-term memory component; they learn how to use these jointly. The long-term memory can be read and written to, with the goal of using it for prediction. We investigate these models in the context of question answering (QA) where the long-term memory effectively act… ▽ More

    Submitted 29 November, 2015; v1 submitted 14 October, 2014; originally announced October 2014.

  29. arXiv:1406.3676  [pdf, other

    cs.CL

    Question Answering with Subgraph Embeddings

    Authors: Antoine Bordes, Sumit Chopra, Jason Weston

    Abstract: This paper presents a system which learns to answer questions on a broad range of topics from a knowledge base using few hand-crafted features. Our model learns low-dimensional embeddings of words and knowledge base constituents; these representations are used to score natural language questions against candidate answers. Training our system using pairs of questions and structured representations… ▽ More

    Submitted 3 September, 2014; v1 submitted 13 June, 2014; originally announced June 2014.

  30. arXiv:1404.4326  [pdf, other

    cs.CL cs.LG

    Open Question Answering with Weakly Supervised Embedding Models

    Authors: Antoine Bordes, Jason Weston, Nicolas Usunier

    Abstract: Building computers able to answer questions on any subject is a long standing goal of artificial intelligence. Promising progress has recently been achieved by methods that learn to map questions to logical forms or database queries. Such approaches can be effective but at the cost of either large amounts of human-labeled data or by defining lexicons and grammars tailored by practitioners. In this… ▽ More

    Submitted 16 April, 2014; originally announced April 2014.

  31. arXiv:1307.7973  [pdf, ps, other

    cs.CL cs.IR cs.LG

    Connecting Language and Knowledge Bases with Embedding Models for Relation Extraction

    Authors: Jason Weston, Antoine Bordes, Oksana Yakhnenko, Nicolas Usunier

    Abstract: This paper proposes a novel approach for relation extraction from free text which is trained to jointly use information from the text and from existing knowledge. Our model is based on two scoring functions that operate by learning low-dimensional embeddings of words and of entities and relationships from a knowledge base. We empirically show on New York Times articles aligned with Freebase relati… ▽ More

    Submitted 30 July, 2013; originally announced July 2013.

  32. arXiv:1304.7158  [pdf, ps, other

    cs.LG

    Irreflexive and Hierarchical Relations as Translations

    Authors: Antoine Bordes, Nicolas Usunier, Alberto Garcia-Duran, Jason Weston, Oksana Yakhnenko

    Abstract: We consider the problem of embedding entities and relations of knowledge bases in low-dimensional vector spaces. Unlike most existing approaches, which are primarily efficient for modeling equivalence relations, our approach is designed to explicitly model irreflexive relations, such as hierarchies, by interpreting them as translations operating on the low-dimensional embeddings of the entities. P… ▽ More

    Submitted 26 April, 2013; originally announced April 2013.

    Comments: Submitted at the ICML 2013 workshop "Structured Learning: Inferring Graphs from Structured and Unstructured Inputs"

  33. arXiv:1301.3485  [pdf, ps, other

    cs.LG

    A Semantic Matching Energy Function for Learning with Multi-relational Data

    Authors: Xavier Glorot, Antoine Bordes, Jason Weston, Yoshua Bengio

    Abstract: Large-scale relational learning becomes crucial for handling the huge amounts of structured data generated daily in many application domains ranging from computational biology or information retrieval, to natural language processing. In this paper, we present a new neural network architecture designed to embed multi-relational graphs into a flexible continuous vector space in which the original da… ▽ More

    Submitted 21 March, 2013; v1 submitted 15 January, 2013; originally announced January 2013.

  34. arXiv:1107.3663  [pdf, other

    cs.AI

    Towards Open-Text Semantic Parsing via Multi-Task Learning of Structured Embeddings

    Authors: Antoine Bordes, Xavier Glorot, Jason Weston, Yoshua Bengio

    Abstract: Open-text (or open-domain) semantic parsers are designed to interpret any statement in natural language by inferring a corresponding meaning representation (MR). Unfortunately, large scale systems cannot be easily machine-learned due to lack of directly supervised data. We propose here a method that learns to assign MRs to a wide range of text (using a dictionary of more than 70,000 words, which a… ▽ More

    Submitted 19 July, 2011; originally announced July 2011.