ePrints@IIScePrints@IISc Home | About | Browse | Latest Additions | Advanced Search | Contact | Help

CADENCE: Offline Category Constrained and Diverse Query Generation for E-commerce Autosuggest

Anand, A and Kumar, S and Kumar, N and Shah, S (2023) CADENCE: Offline Category Constrained and Diverse Query Generation for E-commerce Autosuggest. In: 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, KDD 2023, 6-10 August 2023, Long Beach, pp. 3703-3712.

[img] PDF
ACM-SIGKDD_KDD2023_3703-3712_2023.pdf - Published Version
Restricted to Registered users only

Download (1MB) | Request a copy
Official URL: https://dl.acm.org/doi/10.1145/3580305.3599787

Abstract

Query AutoComplete (QAC) or AutoSuggest is the first place of user interaction with an e-commerce search engine. It is critical for the QAC system to suggest relevant and well-formed queries for multiple possible user intents. Suggesting only the historical user queries fails in the case of infrequent or new prefixes. Much of the recent works generate synthetic candidates using models trained on user queries and thus have these issues: a) cold start problem as new products in the catalogue fail to get visibility due to lack of representation in user queries b) poor quality of generated candidates due to concept drift and c) low diversity/coverage of attributes such as brand, color & other facets in generated candidates. In this paper, we propose an offline neural query generation framework - CADENCE - to address these challenges by a) using both user queries and noisy product titles to train two separate neural language models using self-attention memory networks, b) adding category constraints during the training and query generation process to prevent concept drift c) implementing customized dynamic beam search to generate more diverse candidates for a given prefix. Besides solving for cold start and rare/unseen prefix coverage, CADENCE also increases the coverage of the existing query prefixes through a higher number of relevant and diverse query suggestions. We generated ∼700K new offline queries, which have resulted in significant improvement in recall, reduction in product cold start, and increased coverage of attributes. Online A/B tests also show a significant impact on QAC usage, downstream search click-through rates, and product conversion. © 2023 ACM.

Item Type: Conference Paper
Publication: Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
Publisher: Association for Computing Machinery
Additional Information: The copyright of this article belongs to the Association for Computing Machinery.
Keywords: Computational linguistics; Search engines, Autocomplete; Beam search; Constrained dnn; Diverse beam search; E- commerces; E-commerce search; Generative autosuggest; Language model; Neural language model; User query, Electronic commerce
Department/Centre: Division of Physical & Mathematical Sciences > Physics
Date Deposited: 24 Nov 2023 10:17
Last Modified: 24 Nov 2023 10:17
URI: https://eprints.iisc.ac.in/id/eprint/83242

Actions (login required)

View Item View Item