Tarique Siddiqui

PhD Student
Department of Computer Science
University of Illinois at Urbana Champaign(UIUC)
   SC 1121, 201 N Goodwin Avenue, Urbana, IL 61801

I am a PhD student in the Databases and Information Systems Research Group (DAIS) at UIUC, advised by Prof. Aditya Parameswaran.

My research interests lie at the intersection of database systems , data mining and human-computer interaction . I am currently building the next generation visual data analytics system for exploratory analysis of large datasets with focus on developing fast in-memory database systems, visual query language, integration of data mining techniques with database management systems and interactive visual interfaces. I also work on heterogeneous network based text mining problems for identifying and ranking interesting concepts in large scale datasets such as scientific corpora.

Previously, I have worked at Goldman Sachs on complex event processing, and large scale distributed system problems with emphasis on building online, self-learning and self-healing systems. I have a good knowledge of design, development, and deployment of multi-tier real world systems.

I received my B.Tech in Computer science from National Institute of Technology India in 2011. I am a recipient of Siebel Scholars Award, 2015 and Indian National Talent Search (NTS) Scholar award, 2005.
What's New
  • Oct 15, 2016: Our full paper on zenvisage has been accepted at VLDB 2017. Pre-camera ready version here.
  • Oct 11, 2016: Our demo paper on zenvisage has been accepted at CIDR 2017. Pre-camera ready version here.
  • July 17, 2016: Our full paper on FacetGist: Collective Extraction of Document Facets in Large Technical Corpora, has been accepted at CIKM 2017. Paper here.
  • May 13, 2016: We gave a talk on zenvisage at the multi-institution reading group on Visualization for Data Exploration and Analysis. Slides here.
  • May 1, 2016: Release of a new preprint on zenvisage. Paper here.
  • October 26, 2015, We presented our workshop paper on visualization recommendatiom systems at DSIA, VIS 2015 Chicago. Slides from Prof. Aditya Parameswaran.
  • September 2015, Our paper on "Towards Visualization Recommendation Systems" got accepted at the workshop on Data Systems for Interactive Analysis(DSIA), VIS 2015 Chicago
  • September 2015, Thrilled to be receiving the Siebel Scholars Award for the class of 2016. Announcements: 1, 2.
  • August 2014, Excited to be starting my MS under Prof. Aditya Parameswaran at the Department of Computer Science, UIUC
  • Effortless Data Exploration with zenvisage: An Expressive and Interactive Visual Analytics System  
    Tarique Siddiqui, Albert Kim, John Lee, Karrie Karahalios, Aditya Parameswaran
    VLDB 2017
  • Fast-Forwarding to Desired Visualizations with zenvisage  
    Tarique Siddiqui, John Lee, Albert Kim, Edward Xue, Xiaofo Yu, Sean Zou, Lijin Guo,Changfeng Liu, Chaoran Wang, Karrie Karahalios, Aditya Parameswaran
    CIDR 2017
  • FacetGist: Collective Extraction of Document Facets in Large Technical Corpora  
    Tarique Siddiqui, Xiang Ren, Aditya Parameswaran and Jiawei Han
    CIKM 2016
  • Towards Visualization Recommendation Systems  
    Manasi Vartak, Silu Huang, Tarique Siddiqui, Samuel Madden and Aditya Parameswaran.
    Workshop on Data Systems for Interactive Analysis(DSIA), VIS 2015 Chicago
Research Highlights
Current Projects


Zenvisage is a visual analytics system for ad-hoc, interactive, visual exploration of data. It enables users directly specify insights, i.e., trends, patterns, or anomalies of interest through a novel query language- ZQL and a set of interactive visual interfaces. The system efficiently identifies the right visualization that meets user specifications.

Collective Paper Profiling(CPP)

Given a collection of scientific documents, Paper Profiling is to automatically label each document with a set of concepts on different key aspects that people are interested in (e.g., application, technique, and dataset). We propose a novel graph-based framework for paper profiling. It considers not only local sentence-level features, such as string suffix and surrounding relation phrase, but also global context information, including document topics and section structures, to model the aspect of a concept mentioned in a document. This task has many interesting applications in scientific domain, including document summarization, literature search, patentability study and business intelligence.


Traditional database systems operate in an all-or-nothing manner, taking as long as it takes to return the entire set of results, however large the result set may be. However, analysts performing data exploration often browse, i.e., pose a query, examine a few of the resulting records and then repeatedly issue new queries. To this end, I am contributing to the development of an alternative database query interaction paradigm called browsing. The aim is to make efficient use of bitmap indices and design fast sampling algorithms for rapid retrieval of a small number of query result records.

Key Previous Projects

Fabric: A Complex Event Processing System

I contributed to the desing and development of Fabric: an agent based distributed complex event processing framework for alerting, trending and predictive analytics. discovering the dependencies among the firm’s applications and systems for risk, impact and root cause analysis by mining large scale heterogeneous datasets such as system metrics, application metrics, trading data etc.

3D Object Recognition and Targeting System [under-grad thesis]

I designed and developed a feature based 3D object recognition and targeting system using SURF descriptors, sum of squared differences (SSD) and disparity calculation technique to determine the pose and position of the object for precise targeting. Further, I developed a tagging game using wireless sensor networks, 3D object recognition techniques, xBee sensors and Arduino microcontrollers where a set of robots recognize and target each other. The game can be hosted at different game spots as diverse as malls, amusement parks or homes.

I assisted and helped in designing the following data management and system related courses at UIUC:
  • Human in Loop Data Mananagement
    Fall 2015 with Prof. Aditya Parameswaran
  • Advanced Data Management
    Spring 2015 with Prof. Aditya Parameswaran
  • Systems Programming
    Fall 2014 with Dr. Lawrence Angrave

When I have spare time, I enjoy arts, traveling, listening to urdu poetry and watching cricket.