Ankit Agrawal

Orcid: 0000-0002-5519-0302

Affiliations:
  • Northwestern University, Evanston, IL, USA
  • Iowa State University, Ames, IA, USA (PhD 2009)


According to our database1, Ankit Agrawal authored at least 148 papers between 2008 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Automated Nanoparticle Image Processing Pipeline for AI-Driven Materials Characterization.
Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, 2024

2023
MPpredictor: An Artificial Intelligence-Driven Web Tool for Composition-Based Material Property Prediction.
J. Chem. Inf. Model., April, 2023

A deep learning framework for layer-wise porosity prediction in metal powder bed fusion using thermal signatures.
J. Intell. Manuf., 2023

I/O in WRF: A Case Study in Modern Parallel I/O Techniques.
Proceedings of the International Conference for High Performance Computing, 2023

AI for Learning Deformation Behavior of a Material: Predicting Stress-Strain Curves 4000x Faster Than Simulations.
Proceedings of the International Joint Conference on Neural Networks, 2023

Pre-Activation based Representation Learning to Enhance Predictive Analytics on Small Materials Data.
Proceedings of the International Joint Conference on Neural Networks, 2023

A Deep Learning Framework for Time-Series Processing-Microstructure-Property Prediction.
Proceedings of the International Conference on Machine Learning and Applications, 2023

Physics-based Data-Augmented Deep Learning for Enhanced Autogenous Shrinkage Prediction on Experimental Dataset.
Proceedings of the 2023 Fifteenth International Conference on Contemporary Computing, 2023

A Case Study of Data Management Challenges Presented in Large-Scale Machine Learning Workflows.
Proceedings of the 23rd IEEE/ACM International Symposium on Cluster, 2023

2022
A case study on parallel HDF5 dataset concatenation for high energy physics data analysis.
Parallel Comput., 2022

Improving scalability of parallel CNN training by adaptively adjusting parameter update frequency.
J. Parallel Distributed Comput., 2022

An Incremental Phase Mapping Approach for X-ray Diffraction Patterns using Binary Peak Representations.
CoRR, 2022

A Case Study on Parallel HDF5 Dataset Concatenation for High Energy Physics Data Analysis.
CoRR, 2022

BRNet: Branched Residual Network for Fast and Accurate Predictive Modeling of Materials Properties.
Proceedings of the 2022 SIAM International Conference on Data Mining, 2022

Machine Learning for Materials Science (MLMS).
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

To Shuffle or Not To Shuffle: Mini-Batch Shuffling Strategies for Multi-class Imbalanced Classification.
Proceedings of the International Conference on Computational Science and Computational Intelligence, 2022

Which Deep Learning Framework Should I Use: A Comparative Study For Deep Regression Modeling.
Proceedings of the International Conference on Computational Science and Computational Intelligence, 2022

Using Multi-Resolution Data to Accelerate Neural Network Training in Scientific Applications.
Proceedings of the 22nd IEEE International Symposium on Cluster, 2022

2021
Harnessing Psycho-lingual and Crowd-Sourced Dictionaries for Predicting Taboos in Written Emotional Disclosure in Anonymous Confession Boards.
J. Heal. Informatics Res., 2021

A General Framework Combining Generative Adversarial Networks and Mixture Density Networks for Inverse Modeling in Microstructural Materials Design.
CoRR, 2021

Enhancing Phase Mapping for High-throughput X-ray Diffraction Experiments using Fuzzy Clustering.
Proceedings of the 10th International Conference on Pattern Recognition Applications and Methods, 2021

SIGRNN: Synthetic Minority Instances Generation in Imbalanced Datasets using a Recurrent Neural Network.
Proceedings of the 10th International Conference on Pattern Recognition Applications and Methods, 2021

Asynchronous I/O Strategy for Large-Scale Deep Learning Applications.
Proceedings of the 28th IEEE International Conference on High Performance Computing, 2021

Supporting Data Compression in PnetCDF.
Proceedings of the 2021 IEEE International Conference on Big Data (Big Data), 2021

2020
Improving MPI Collective I/O for High Volume Non-Contiguous Requests With Intra-Node Aggregation.
IEEE Trans. Parallel Distributed Syst., 2020

Improving all-to-many personalized communication in two-phase I/O.
Proceedings of the International Conference for High Performance Computing, 2020

Heterogeneous Feature Fusion Based Machine Learning on Shallow-Wide and Heterogeneous-Sparse Industrial Datasets.
Proceedings of the Pattern Recognition. ICPR International Workshops and Challenges, 2020

Predicting Resource Requirement in Intermediate Palomar Transient Factory Workflow.
Proceedings of the 20th IEEE/ACM International Symposium on Cluster, 2020

Communication-Efficient Local Stochastic Gradient Descent for Scalable Deep Learning.
Proceedings of the 2020 IEEE International Conference on Big Data (IEEE BigData 2020), 2020

2019
Scalable Algorithms for MPI Intergroup Allgather and Allgatherv.
Parallel Comput., 2019

Survivability prediction of colon cancer patients using neural networks.
Health Informatics J., 2019

Improving MPI Collective I/O Performance With Intra-node Request Aggregation.
CoRR, 2019

IRNet: A General Purpose Deep Residual Regression Framework for Materials Discovery.
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

Deep learning based domain knowledge integration for small datasets: Illustrative applications in materials informatics.
Proceedings of the International Joint Conference on Neural Networks, 2019

Transfer Learning Using Ensemble Neural Networks for Organic Solar Cell Screening.
Proceedings of the International Joint Conference on Neural Networks, 2019

Peak Area Detection Network for Directly Learning Phase Regions from Raw X-ray Diffraction Patterns.
Proceedings of the International Joint Conference on Neural Networks, 2019

Data-Driven Insights from Predictive Analytics on Heterogeneous Experimental Data of Industrial Magnetic Materials.
Proceedings of the 2019 International Conference on Data Mining Workshops, 2019

A Real-Time Iterative Machine Learning Approach for Temperature Profile Prediction in Additive Manufacturing Processes.
Proceedings of the 2019 IEEE International Conference on Data Science and Advanced Analytics, 2019

Martensite Start Temperature Predictor for Steels Using Ensemble Data Mining.
Proceedings of the 2019 IEEE International Conference on Data Science and Advanced Analytics, 2019

Improving Scalability of Parallel CNN Training by Adjusting Mini-Batch Size at Run-Time.
Proceedings of the 2019 IEEE International Conference on Big Data (IEEE BigData), 2019

Spatiotemporal Real-Time Anomaly Detection for Supercomputing Systems.
Proceedings of the 2019 IEEE International Conference on Big Data (IEEE BigData), 2019

2018
Prediction of seebeck coefficient for compounds without restriction to fixed stoichiometry: A machine learning approach.
J. Comput. Chem., 2018

CheMixNet: Mixed DNN Architectures for Predicting Chemical Properties using Multiple Molecular Representations.
CoRR, 2018

Integration of Burst Buffer in High-level Parallel I/O Library for Exa-scale Computing Era.
Proceedings of the 3rd IEEE/ACM International Workshop on Parallel Data Storage & Data Intensive Scalable Computing Systems, 2018

Full-Duplex Inter-Group All-to-All Broadcast Algorithms with Optimal Bandwidth.
Proceedings of the 25th European MPI Users' Group Meeting, 2018

Towards Autonomic Science Infrastructure: Architecture, Limitations, and Open Issues.
Proceedings of the 1st International Workshop on Autonomous Infrastructure for Science, 2018

Parallel DBSCAN Algorithm Using a Data Partitioning Strategy with Spark Implementation.
Proceedings of the IEEE International Conference on Big Data (IEEE BigData 2018), 2018

2017
Legislative prediction with dual uncertainty minimization from heterogeneous information.
Stat. Anal. Data Min., 2017

SILVERBACK+: scalable association mining via fast list intersection for columnar social data.
Knowl. Inf. Syst., 2017

Medical Concept Normalization for Online User-Generated Texts.
Proceedings of the 2017 IEEE International Conference on Healthcare Informatics, 2017

Forecasting Influenza Levels Using Real-Time Social Media Streams.
Proceedings of the 2017 IEEE International Conference on Healthcare Informatics, 2017

Towards Identifying Informal Caregivers of Alzheimer's and Dementia Patients in Social Media.
Proceedings of the 2017 IEEE International Conference on Healthcare Informatics, 2017

A Hybrid Training Algorithm for Recurrent Neural Network Using Particle Swarm Optimization-Based Preprocessing and Temporal Error Aggregation.
Proceedings of the 2017 IEEE International Conference on Data Mining Workshops, 2017

Analyzing Informal Caregiving Expression in Social Media.
Proceedings of the 2017 IEEE International Conference on Data Mining Workshops, 2017

Building Halo Merger Trees from the Q Continuum Simulation.
Proceedings of the 24th IEEE International Conference on High Performance Computing, 2017

Parallel Deep Convolutional Neural Network Training by Exploiting the Overlapping of Computation and Communication.
Proceedings of the 24th IEEE International Conference on High Performance Computing, 2017

Distinguish Polarity in Bag-of-Words Visualization.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Parallel Community Detection Algorithm Using a Data Partitioning Strategy with Pairwise Subdomain Duplication.
Proceedings of the High Performance Computing - 31st International Conference, 2016

A Novel Scalable DBSCAN Algorithm with Spark.
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium Workshops, 2016

Towards Informal Caregiver Identification in Social Media.
Proceedings of the 2016 IEEE International Conference on Healthcare Informatics, 2016

Predicting the Outcome of Startups: Less Failure, More Success.
Proceedings of the IEEE International Conference on Data Mining Workshops, 2016

A Formation Energy Predictor for Crystalline Materials Using Ensemble Data Mining.
Proceedings of the IEEE International Conference on Data Mining Workshops, 2016

Five Year Life Expectancy Calculator for Older Adults.
Proceedings of the IEEE International Conference on Data Mining Workshops, 2016

Identifying hotspots in five year survival electronic health records of older adults.
Proceedings of the 6th IEEE International Conference on Computational Advances in Bio and Medical Sciences, 2016

AGORAS: A Fast Algorithm for Estimating Medoids in Large Datasets.
Proceedings of the International Conference on Computational Science 2016, 2016

Parallel Implementation of Lossy Data Compression for Temporal Data Sets.
Proceedings of the 23rd IEEE International Conference on High Performance Computing, 2016

Parallel DTFE Surface Density Field Reconstruction.
Proceedings of the 2016 IEEE International Conference on Cluster Computing, 2016

A Filtering-based Clustering Algorithm for Improving Spatio-temporal Kriging Interpolation Accuracy.
Proceedings of the 25th ACM International Conference on Information and Knowledge Management, 2016

A Fatigue Strength Predictor for Steels Using Ensemble Data Mining: Steel Fatigue Strength Predictor.
Proceedings of the 25th ACM International Conference on Information and Knowledge Management, 2016

PinterNet: A thematic label curation tool for large image datasets.
Proceedings of the 2016 IEEE International Conference on Big Data (IEEE BigData 2016), 2016

Materials discovery: Understanding polycrystals from large-scale electron patterns.
Proceedings of the 2016 IEEE International Conference on Big Data (IEEE BigData 2016), 2016

Evaluation of K-means data clustering algorithm on Intel Xeon Phi.
Proceedings of the 2016 IEEE International Conference on Big Data (IEEE BigData 2016), 2016

2015
Qualitative Decision Methods for Multi-Attribute Decision Making.
CoRR, 2015

Incremental, distributed single-linkage hierarchical clustering algorithm using mapreduce.
Proceedings of the Symposium on High Performance Computing, 2015

All your Google and Facebook logins are belong to us: A case for single sign-off.
Proceedings of the Eighth International Conference on Contemporary Computing, 2015

Pruned search: A machine learning based meta-heuristic approach for constrained continuous optimization.
Proceedings of the Eighth International Conference on Contemporary Computing, 2015

Running MAP Inference on Million Node Graphical Models: A High Performance Computing Perspective.
Proceedings of the 15th IEEE/ACM International Symposium on Cluster, 2015

A Scalable Hierarchical Clustering Algorithm Using Spark.
Proceedings of the First IEEE International Conference on Big Data Computing Service and Applications, 2015

Mining Social Media Streams to Improve Public Health Allergy Surveillance.
Proceedings of the 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, 2015

Reducing infrequent-token perplexity via variational corpora.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

2014
Data Compression for the Exascale Computing Era - Survey.
Supercomput. Front. Innov., 2014

Excavating social circles via user interests.
Soc. Netw. Anal. Min., 2014

MuSES: Multilingual Sentiment Elicitation System for Social Media Data.
IEEE Intell. Syst., 2014

NUMARCK: Machine Learning Algorithm for Resiliency and Checkpointing.
Proceedings of the International Conference for High Performance Computing, 2014

Enhancing Financial Decision-Making Using Social Behavior Modeling.
Proceedings of the 8th Workshop on Social Network Mining and Analysis, 2014

A New Parallel Algorithm for Two-Pass Connected Component Labeling.
Proceedings of the 2014 IEEE International Parallel & Distributed Processing Symposium Workshops, 2014

Estimating Online User Location Distribution without GPS Location.
Proceedings of the 2014 IEEE International Conference on Data Mining Workshops, 2014

Social Role Identification via Dual Uncertainty Minimization Regularization.
Proceedings of the 2014 IEEE International Conference on Data Mining, 2014

SILVERBACK: Scalable association mining for temporal data in columnar probabilistic databases.
Proceedings of the IEEE 30th International Conference on Data Engineering, Chicago, 2014

Clique guided community detection.
Proceedings of the 2014 IEEE International Conference on Big Data (IEEE BigData 2014), 2014

Indexing bipartite memberships in web graphs.
Proceedings of the 2014 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, 2014

2013
Graphical Modeling of Macro Behavioral Targeting in Social Networks.
Proceedings of the 13th SIAM International Conference on Data Mining, 2013

Scalable parallel OPTICS data clustering using graph algorithmic techniques.
Proceedings of the International Conference for High Performance Computing, 2013

A scalable algorithm for single-linkage hierarchical clustering on distributed-memory architectures.
Proceedings of the IEEE Symposium on Large-Scale Data Analysis and Visualization, 2013

Real-time disease surveillance using Twitter data: demonstration on flu and cancer.
Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2013

Solving combinatorial optimization problems using relaxed linear programming: a high performance computing perspective.
Proceedings of the 2nd International Workshop on Big Data, 2013

JobMiner: a real-time system for mining job-related patterns from social media.
Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2013

Forecast Oriented Classification of Spatio-Temporal Extreme Events.
Proceedings of the IJCAI 2013, 2013

Random walk-based graphical sampling in unbalanced heterogeneous bipartite social graphs.
Proceedings of the 22nd ACM International Conference on Information and Knowledge Management, 2013

Mining diabetes complication and treatment patterns for clinical decision support.
Proceedings of the 22nd ACM International Conference on Information and Knowledge Management, 2013

Bootstrapping active name disambiguation with crowdsourcing.
Proceedings of the 22nd ACM International Conference on Information and Knowledge Management, 2013

Feedback-driven multiclass active learning for data streams.
Proceedings of the 22nd ACM International Conference on Information and Knowledge Management, 2013

Elver: Recommending Facebook pages in cold start situation without content features.
Proceedings of the 2013 IEEE International Conference on Big Data (IEEE BigData 2013), 2013

Colon cancer survival prediction using ensemble data mining on SEER data.
Proceedings of the 2013 IEEE International Conference on Big Data (IEEE BigData 2013), 2013

Lung transplant outcome prediction using UNOS data.
Proceedings of the 2013 IEEE International Conference on Big Data (IEEE BigData 2013), 2013

A probabilistic graphical model for brand reputation assessment in social networks.
Proceedings of the Advances in Social Networks Analysis and Mining 2013, 2013

2012
Lung cancer survival prediction using ensemble data mining on SEER data.
Sci. Program., 2012

Accelerating pairwise statistical significance estimation for local alignment by harvesting GPU's power.
BMC Bioinform., 2012

Sentiment identification by incorporating syntax, semantics and context information.
Proceedings of the 35th International ACM SIGIR conference on research and development in Information Retrieval, 2012

A new scalable parallel DBSCAN algorithm using the disjoint-set data structure.
Proceedings of the SC Conference on High Performance Computing Networking, 2012

VOXSUP: a social engagement framework.
Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2012

CluChunk: clustering large scale user-generated content incorporating chunklet information.
Proceedings of the 1st International Workshop on Big Data, 2012

High Performance Big Data Clustering.
Proceedings of the Transition of HPC Towards Exascale Computing, 2012

Parallel hierarchical clustering on shared memory platforms.
Proceedings of the 19th International Conference on High Performance Computing, 2012

Probabilistic macro behavioral targeting.
Proceedings of the 2012 workshop on Data-driven User Behavioral Modelling and Mining from Social Media, 2012

On active learning in hierarchical classification.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

2011
Pairwise Statistical Significance of Local Sequence Alignment Using Sequence-Specific and Position-Specific Substitution Matrices.
IEEE ACM Trans. Comput. Biol. Bioinform., 2011

Association Rule Mining Based HotSpot Analysis on SEER Lung Cancer Data.
Int. J. Knowl. Discov. Bioinform., 2011

Parallel pairwise statistical significance estimation of local sequence alignment using Message Passing Interface library.
Concurr. Comput. Pract. Exp., 2011

Anatomy of a hash-based long read sequence mapping algorithm for next generation DNA sequencing.
Bioinform., 2011

A lung cancer outcome calculator using ensemble data mining on SEER data.
Proceedings of the Tenth International Workshop on Data Mining in Bioinformatics, 2011

Predicting Node Failure in High Performance Computing Systems from Failure and Usage Logs.
Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011

SES: Sentiment Elicitation System for Social Media Data.
Proceedings of the Data Mining Workshops (ICDMW), 2011

Poll: A Citation Text Based System for Identifying High-Impact Contributions of an Article.
Proceedings of the Data Mining Workshops (ICDMW), 2011

Twitter Trending Topic Classification.
Proceedings of the Data Mining Workshops (ICDMW), 2011

Community Dynamics and Analysis of Decadal Trends in Climate Data.
Proceedings of the Data Mining Workshops (ICDMW), 2011

Learning to Group Web Text Incorporating Prior Information.
Proceedings of the Data Mining Workshops (ICDMW), 2011

Identifying HotSpots in Lung Cancer Data Using Association Rule Mining.
Proceedings of the Data Mining Workshops (ICDMW), 2011

Efficient pairwise statistical significance estimation for local sequence alignment using GPU.
Proceedings of the IEEE 1st International Conference on Computational Advances in Bio and Medical Sciences, 2011

Poster: A lung cancer mortality risk calculator based on SEER data.
Proceedings of the IEEE 1st International Conference on Computational Advances in Bio and Medical Sciences, 2011

Poster: Risk prediction for post-operative adverse outcomes in colorectal cancer surgery.
Proceedings of the IEEE 1st International Conference on Computational Advances in Bio and Medical Sciences, 2011

2010
Fuzzy-adaptive-thresholding-based exon prediction.
Int. J. Comput. Biol. Drug Des., 2010

MPIPairwiseStatSig: parallel pairwise statistical significance estimation of local sequence alignment.
Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing, 2010

Efficient Pairwise Statistical Significance Estimation using FPGAs.
Proceedings of the International Conference on Bioinformatics & Computational Biology, 2010

Non-Conservative Pairwise Statistical Significance of Local Sequence Alignment Using Position-Specific Substitution Matrices.
Proceedings of the International Conference on Bioinformatics & Computational Biology, 2010

Derived distribution points heuristic for fast pairwise statistical significance estimation.
Proceedings of the First ACM International Conference on Bioinformatics and Computational Biology, 2010

2009
Pairwise statistical significance of local sequence alignment using multiple parameter sets and empirical justification of parameter set change penalty.
BMC Bioinform., 2009

PSIBLAST_PairwiseStatSig: reordering PSI-BLAST hits using pairwise statistical significance.
Bioinform., 2009

2008
Pairwise statistical significance and empirical determination of effective gap opening penalties for protein local sequence alignment.
Int. J. Comput. Biol. Drug Des., 2008

Estimating Pairwise Statistical Significance of Protein Local Alignments Using a Clustering-Classification Approach Based on Amino Acid Composition.
Proceedings of the Bioinformatics Research and Applications, 2008

Pairwise Statistical Significance Versus Database Statistical Significance for Local Alignment of Protein Sequences.
Proceedings of the Bioinformatics Research and Applications, 2008

An adaptive fuzzy thresholding algorithm for exon prediction.
Proceedings of the 2008 IEEE International Conference on Electro/Information Technology, 2008

A new heuristic for multiple sequence alignment.
Proceedings of the 2008 IEEE International Conference on Electro/Information Technology, 2008

DNAlignTT: Pairwise DNA alignment with sequence specific transition-transversion ratio.
Proceedings of the 2008 IEEE International Conference on Electro/Information Technology, 2008

Pairwise Statistical Significance of Local Sequence Alignment Using Substitution Matrices with Sequence-Pair-Specific Distance.
Proceedings of the 2008 International Conference on Information Technology, 2008

Pairwise DNA Alignment with Sequence Specific Transition-Transversion Ratio Using Multiple Parameter Sets.
Proceedings of the 2008 International Conference on Information Technology, 2008

Pairwise statistical significance of local sequence alignment using multiple parameter sets.
Proceedings of the Proceeding of the 2nd International Workshop on Data and Text Mining in Bioinformatics, 2008

Conservative, Non-conservative and Average Pairwise Statistical Significance of Local Sequence Alignment.
Proceedings of the 2008 IEEE International Conference on Bioinformatics and Biomedicine, 2008


  Loading...