Jian Wu

According to our database1, Jian Wu authored at least 39 papers between 2012 and 2020.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2020
Large Scale Subject Category Classification of Scholarly Papers with Deep Attentive Neural Networks.
CoRR, 2020

A Comparative Study of Sequence Tagging Methods for Domain Knowledge Entity Recognition in Biomedical Papers.
Proceedings of the JCDL '20: Proceedings of the ACM/IEEE Joint Conference on Digital Libraries in 2020, 2020

Keyphrase Extraction in Scholarly Digital Library Search Engines.
Proceedings of the Web Services - ICWS 2020, 2020

Accelerating Substructure Similarity Search for Formula Retrieval.
Proceedings of the Advances in Information Retrieval, 2020

COVIDSeer: Extending the CORD-19 Dataset.
Proceedings of the DocEng '20: ACM Symposium on Document Engineering 2020, Virtual Event, CA, USA, September 29, 2020

PSU at CLEF-2020 ARQMath Track: Unsupervised Re-ranking using Pretraining.
Proceedings of the Working Notes of CLEF 2020, 2020

2019
Query Auto Completion for Math Formula Search.
CoRR, 2019

Sec-Lib: Protecting Scholarly Digital Libraries From Infected Papers Using Active Machine Learning Framework.
IEEE Access, 2019

Automatic Slide Generation for Scientific Papers.
Proceedings of the Third International Workshop on Capturing Scientific Knowledge co-located with the 10th International Conference on Knowledge Capture (K-CAP 2019), 2019

Searching for Evidence of Scientific News in Scholarly Big Data.
Proceedings of the 10th International Conference on Knowledge Capture, 2019

Tangent-CFT: An Embedding Model for Mathematical Formulas.
Proceedings of the 2019 ACM SIGIR International Conference on Theory of Information Retrieval, 2019

Learned Neural Iterative Decoding for Lossy Image Compression Systems.
Proceedings of the Data Compression Conference, 2019

CiteSeerX: 20 years of service to scholarly big data.
Proceedings of the Conference on Artificial Intelligence for Data Discovery and Reuse, 2019

Cleaning Noisy and Heterogeneous Metadata for Record Linking across Scholarly Big Datasets.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Learned Iterative Decoding for Lossy Image Compression Systems.
CoRR, 2018

CiteSeerX-2018: A Cleansed Multidisciplinary Scholarly Big Dataset.
Proceedings of the IEEE International Conference on Big Data, 2018

2017
Scholarly Digital Libraries as a Platform for Malware Distribution.
Proceedings of the A Systems Approach to Cyber Security, 2017

A Supervised Learning Approach To Entity Matching Between Scholarly Big Datasets.
Proceedings of the Knowledge Capture Conference, 2017

HESDK: A Hybrid Approach to Extracting Scientific Domain Knowledge Entities.
Proceedings of the 2017 ACM/IEEE Joint Conference on Digital Libraries, 2017

Compiling Keyphrase Candidates for Scientific Literature Based on Wikipedia.
Proceedings of the Joint Proceedings of the 1st Workshop on Temporal Dynamics in Digital Libraries (TDDL 2017), 2017

2016
CiteSeerX data: semanticizing scholarly papers.
Proceedings of the International Workshop on Semantic Big Data, 2016

Information Extraction for Scholarly Digital Libraries.
Proceedings of the 16th ACM/IEEE-CS on Joint Conference on Digital Libraries, 2016

Document Type Classification in Online Digital Libraries.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
CiteSeerX: AI in a Digital Library Search Engine.
AI Mag., 2015

Big Scholarly Data in CiteSeerX: Information Extraction from the Web.
Proceedings of the 24th International Conference on World Wide Web Companion, 2015

Online Learning of Deep Hybrid Architectures for Semi-supervised Categorization.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2015

PDFMEF: A Multi-Entity Knowledge Extraction Framework for Scholarly Documents and Semantic Search.
Proceedings of the 8th International Conference on Knowledge Capture, 2015

2014
Towards building a scholarly big data platform: Challenges, lessons and opportunities.
Proceedings of the IEEE/ACM Joint Conference on Digital Libraries, 2014

A Web Service for Scholarly Big Data Information Extraction.
Proceedings of the 2014 IEEE International Conference on Web Services, 2014

Scholarly big data information extraction and integration in the CiteSeer<sup>χ</sup> digital library.
Proceedings of the Workshops Proceedings of the 30th International Conference on Data Engineering Workshops, 2014

Migrating a Digital Library to a Private Cloud.
Proceedings of the 2014 IEEE International Conference on Cloud Engineering, 2014

Utility-Based Control Feedback in a Digital Library Search Engine: Cases in CiteSeerX.
Proceedings of the 9th International Workshop on Feedback Computing, 2014

CiteSeer x : A Scholarly Big Dataset.
Proceedings of the Advances in Information Retrieval, 2014

SimSeerX: a similar document search engine.
Proceedings of the ACM Symposium on Document Engineering 2014, 2014

The impact of user corrections on a crawl-based digital library: A CiteSeerX perspective.
Proceedings of the 10th IEEE International Conference on Collaborative Computing: Networking, 2014

CiteSeerX: AI in a Digital Library Search Engine.
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

2012
Specialized Research Datasets in the CiteSeer<sup>x</sup> Digital Library.
D Lib Mag., 2012

Web crawler middleware for search engine digital libraries: a case study for citeseerX.
Proceedings of the Twelfth International Workshop on Web Information and Data Management, 2012

The evolution of a crawling strategy for an academic document search engine: whitelists and blacklists.
Proceedings of the Web Science 2012, 2012


  Loading...