KDD2020

Learning to Cluster Documents into Workspaces Using Large Scale Activity Logs

Weize Kong, Michael Bendersky, Marc Najork, Brandon Vargo, Mike Colagrosso

25 citations

Abstract

Google Drive is widely used for managing personal and work-related documents in the cloud. To help users organize their documents in Google Drive, we develop a new feature to allow users to create a set of working files for ongoing easy access, called workspace. A workspace is a cluster of documents, but unlike a typical document cluster, it contains documents that are not only topically coherent, but are also useful in the ongoing user tasks.