Gerald Friedland

Orcid: 0000-0002-9400-6539

Affiliations:
  • Lawrence Livermore National Laboratory


According to our database1, Gerald Friedland authored at least 174 papers between 2001 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Enhancing GAN-based Vocoders with Contrastive Learning Under Data-Limited Condition.
Proceedings of the IEEE International Conference on Acoustics, 2024

Information-Driven Machine Learning - Data Science as an Engineering Discipline, 4
Springer, ISBN: 978-3-031-39476-8, 2024

2023
Efficient Multimedia Computing: Unleashing the Power of AutoML.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Deep Layers Beware: Unraveling the Surprising Benefits of JPEG Compression for Image Classification Pre-processing.
Proceedings of the IEEE International Symposium on Multimedia, 2023

2022
A Systematic Review of Multimodal Approaches to Online Misinformation Detection.
Proceedings of the 5th IEEE International Conference on Multimedia Information Processing and Retrieval, 2022

2021
OrigamiSet1.0: Two New Datasets for Origami Classification and Difficulty Estimation.
CoRR, 2021

From Tinkering to Engineering: Measurements in Tensorflow Playground.
CoRR, 2021

Overview Paper for Driving Road Safety Forward: Video Data Privacy Task at MediaEval 2021.
Proceedings of the Working Notes Proceedings of the MediaEval 2021 Workshop, 2021

Detecting COVID-19 Conspiracy Theories with Transformers and TF-IDF.
Proceedings of the Working Notes Proceedings of the MediaEval 2021 Workshop, 2021

2020
DIME: An Online Tool for the Visual Comparison of Cross-modal Retrieval Models.
Proceedings of the MultiMedia Modeling - 26th International Conference, 2020

On the Impact of Perceptual Compression on Deep Learning.
Proceedings of the 3rd IEEE Conference on Multimedia Information Processing and Retrieval, 2020

Multi-Modal Ensemble Models for Predicting Video Memorability.
Proceedings of the Working Notes Proceedings of the MediaEval 2020 Workshop, 2020

Detecting Conspiracy Theories from Tweets: Textual and Structural Approaches.
Proceedings of the Working Notes Proceedings of the MediaEval 2020 Workshop, 2020

2019
Efficient Saliency Maps for Explainable AI.
CoRR, 2019

Reproducibility and Experimental Design for Machine Learning on Audio and Multimedia Data.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Supervised Deep Hashing for Highly Efficient Cover Song Detection.
Proceedings of the 2nd IEEE Conference on Multimedia Information Processing and Retrieval, 2019

From Intra-Modal to Inter-Modal Space: Multi-task Learning of Shared Representations for Cross-Modal Retrieval.
Proceedings of the Fifth IEEE International Conference on Multimedia Big Data, 2019

Privacy concerns of multimodal sensor systems.
Proceedings of the Handbook of Multimodal-Multisensor Interfaces: Language Processing, Software, Commercialization, and Emerging Directions, 2019

2018
Cybercasing 2.0: You Get What You Pay For.
CoRR, 2018

One Bit Matters: Understanding Adversarial Examples as the Abuse of Redundancy.
CoRR, 2018

A Practical Approach to Sizing Neural Networks.
CoRR, 2018

The Helmholtz Method: Using Perceptual Compression to Reduce Machine Learning Complexity.
CoRR, 2018

The Accuracy of the Demographic Inferences Shown on Google's Ad Settings.
Proceedings of the 2018 Workshop on Privacy in the Electronic Society, 2018

Rethinking Summarization and Storytelling for Modern Social Multimedia.
Proceedings of the MultiMedia Modeling - 24th International Conference, 2018

Audition for multimedia computing.
Proceedings of the Frontiers of Multimedia Research, 2018

2017
DCAR: A Discriminative and Compact Audio Representation for Audio Processing.
IEEE Trans. Multim., 2017

Contextual Noise Reduction for Domain Adaptive Near-Duplicate Retrieval on Merchandize Images.
IEEE Trans. Image Process., 2017

Field Studies with Multimedia Big Data: Opportunities and Challenges (Extended Ver.
CoRR, 2017

A Data-Centric View on Computational Complexity: P != NP.
CoRR, 2017

A Capacity Scaling Law for Artificial Neural Networks.
CoRR, 2017

An Isomorphism between Lyapunov Exponents and Shannon's Channel Capacity.
CoRR, 2017

Privacy Protection in Online Multimedia.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Privacy vs Multimedia Verification: A Conundrum.
Proceedings of the First International Workshop on Multimedia Verification, MuVer@MM 2017, 2017

The Geo-Privacy Bonus of Popular Photo Enhancements.
Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval, 2017

2016
DCAR: A Discriminative and Compact Audio Representation to Improve Event Detection.
CoRR, 2016

Where to be wary: The impact of widespread photo-taking and image enhancement practices on users' geo-privacy.
CoRR, 2016

YFCC100M: the new data in multimedia research.
Commun. ACM, 2016

The Teaching Privacy Curriculum.
Proceedings of the 47th ACM Technical Symposium on Computing Science Education, 2016

A Discriminative and Compact Audio Representation for Event Detection.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Multimedia Privacy.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

2015
Teaching Privacy: Multimedia Making a Difference.
IEEE Multim., 2015

The New Data and New Challenges in Multimedia Research.
CoRR, 2015

The YLI-MED Corpus: Characteristics, Procedures, and Plans.
CoRR, 2015

Teaching Privacy: What Every Student Needs to Know (Abstract Only).
Proceedings of the 46th ACM Technical Symposium on Computer Science Education, 2015

Insights into Audio-Based Multimedia Event Classification with Neural Networks.
Proceedings of the 2015 Workshop on Community-Organized Multimodal Mining: Opportunities for Novel Solutions, 2015

Multimedia COMMONS - Community-Organized Multimodal Mining: Opportunities for Novel Solutions (MMCommons Workshop 2015).
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Evento 360: Social Event Discovery from Web-scale Multimedia Collection.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Kickstarting the Commons: The YFCC100M and the YLI Corpora.
Proceedings of the 2015 Workshop on Community-Organized Multimodal Mining: Opportunities for Novel Solutions, 2015

Audio-Based Multimedia Event Detection with DNNs and Sparse Sampling.
Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, 2015

An information-theoretic metric of fingerprint effectiveness.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Content-Based Privacy for Consumer-Produced Multimedia.
Proceedings of the Multimedia Data Mining and Analytics - Disruptive Innovation, 2015

Application of Large-Scale Classification Techniques for Simple Location Estimation Experiments.
Proceedings of the Multimodal Location Estimation of Videos and Images, 2015

The Benchmark as a Research Catalyst: Charting the Progress of Geo-prediction for Social Multimedia.
Proceedings of the Multimodal Location Estimation of Videos and Images, 2015

Collaborative Multimodal Location Estimation of Consumer Media.
Proceedings of the Multimodal Location Estimation of Videos and Images, 2015

Human Versus Machine: Establishing a Human Baseline for Multimodal Location Estimation.
Proceedings of the Multimodal Location Estimation of Videos and Images, 2015

Introduction.
Proceedings of the Multimodal Location Estimation of Videos and Images, 2015

2014
Scalable multimedia content analysis on parallel platforms using python.
ACM Trans. Multim. Comput. Commun. Appl., 2014

Creating Experts From the Crowd: Techniques for Finding Workers for Difficult Tasks.
IEEE Trans. Multim., 2014

Guest Editors' Introduction.
Int. J. Semantic Comput., 2014

The Placing Task: A Large-Scale Geo-Estimation Challenge for Social-Media Videos and Images.
Proceedings of the 3rd ACM Multimedia Workshop on Geotagging and Its Applications in Multimedia, 2014

GeoMM 2014: the third ACM multimedia workshop ongeotagging and its applications in multimedia.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Toward efficient, privacy-aware media classification on public databases.
Proceedings of the International Conference on Multimedia Retrieval, 2014

Audio-concept features and hidden Markov models for multimedia event detection.
Proceedings of the 2nd International Workshop on Speech, Language and Audio in Multimedia, 2014

Audio concept classification with Hierarchical Deep Neural Networks.
Proceedings of the 22nd European Signal Processing Conference, 2014

Multimedia Computing.
Cambridge University Press, ISBN: 978-0-521-76451-3, 2014

2013
Editorial for automated media analysis and production for novel TV services.
Multim. Tools Appl., 2013

Narrative theme navigation for sitcoms supported by fan-generated scripts - Video navigation based on acoustic detection of actors and narrative elements.
Multim. Tools Appl., 2013

Exploiting innocuous activity for correlating users across sites.
Proceedings of the 22nd International World Wide Web Conference, 2013

SRI-Sarnoff AURORA System at TRECVID 2013 Multimedia Event Detection and Recounting.
Proceedings of the 2013 TREC Video Retrieval Evaluation, 2013

A novel fusion method for integrating multiple modalities and knowledge for multimodal location estimation.
Proceedings of the 2nd ACM international workshop on Geotagging and its applications in multimedia, 2013

Privacy concerns of sharing multimedia in social networks.
Proceedings of the ACM Multimedia Conference, 2013

Human vs machine: establishing a human baseline for multimodal location estimation.
Proceedings of the ACM Multimedia Conference, 2013

Second ACM multimedia workshop on geotagging and its applications in multimedia (GeoMM 2013).
Proceedings of the ACM Multimedia Conference, 2013

An i-Vector Representation of Acoustic Environments for Audio-Based Video Event Detection on User Generated Content.
Proceedings of the 2013 IEEE International Symposium on Multimedia, 2013

Exploring methods of improving speaker accuracy for speaker diarization.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Audio Concept Ranking for Video Event Detection on User-Generated Content.
Proceedings of the First Workshop on Speech, 2013

Lost in segmentation: Three approaches for speech/non-speech detection in consumer-produced videos.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo, 2013

Nowhere to hide: Exploring user-verification across Flickr accounts.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
Speaker Diarization: A Review of Recent Research.
IEEE Trans. Speech Audio Process., 2012

The ICSI RT-09 Speaker Diarization System.
IEEE Trans. Speech Audio Process., 2012

On the Applicability of Speaker Diarization to Audio Indexing of Non-Speech and Mixed Non-Speech/Speech Video Soundtracks.
Int. J. Multim. Data Eng. Manag., 2012

Industry Dares You: The ACM Multimedia Grand Challenge 2011.
IEEE Multim., 2012

SRI-Sarnoff AURORA System at TRECVID 2012 Multimedia Event Detection and Recounting.
Proceedings of the 2012 TREC Video Retrieval Evaluation, 2012

Name that room: room identification using acoustic features in a recording.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Pushing the limits of mechanical turk: qualifying the crowd for video geo-location.
Proceedings of the ACM multimedia 2012 workshop on Crowdsourcing for multimedia, 2012

AMVA'12: ACM international workshop on audio and multimedia methods for large-scale video analysis.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Privacy concerns in multimedia and their solutions.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

GeoMM'12: ACM international workshop on geotagging and its applications in multimedia.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Brave New Task: User Account Matching.
Proceedings of the Working Notes Proceedings of the MediaEval 2012 Workshop, 2012

The 2012 ICSI/Berkeley Video Location Estimation System.
Proceedings of the Working Notes Proceedings of the MediaEval 2012 Workshop, 2012

Where did I go wrong?: Identifying troublesome segments for speaker diarization systems.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Multimodal Location Estimation of Consumer Media: Dealing with Sparse Training Data.
Proceedings of the 2012 IEEE International Conference on Multimedia and Expo, 2012

Multimodal city-verification on flickr videos using acoustic and textual features.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

How to put it into words - using random forests to extract symbol level descriptions from audio content for concept detection.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
Estimating Dominance in Multi-Party Meetings Using Speaker Diarization.
IEEE Trans. Speech Audio Process., 2011

Semantic Computing and Privacy: a Case Study Using Inferred Geo-Location.
Int. J. Semantic Comput., 2011


Data-Driven vs. Semantic-Technology-Driven Tag-Based Video Location Estimation.
Proceedings of the 5th IEEE International Conference on Semantic Computing (ICSC 2011), 2011

Sherlock holmes' evil twin: on the impact of global inference for online privacy.
Proceedings of the 2011 New Security Paradigms Workshop, 2011

Video2GPS: a demo of multimodal location estimation on flickr videos.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Acoustic and multimodal processing for multimedia content analysis.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

Automatic tagging and geotagging in video collections and communities.
Proceedings of the 1st International Conference on Multimedia Retrieval, 2011

On the Applicability of Speaker Diarization to Audio Concept Detection for Multimedia Retrieval.
Proceedings of the 2011 IEEE International Symposium on Multimedia, 2011

Improved Overlapped Speech Handling for Speaker Diarization.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

User verification: Matching the uploaders of videos across accounts.
Proceedings of the IEEE International Conference on Acoustics, 2011

CUDA-level Performance with Python-level Productivity for Gaussian Mixture Model Applications.
Proceedings of the 3rd USENIX Workshop on Hot Topics in Parallelism, 2011

Fast speaker diarization using a high-level scripting language.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

2010
Dialocalization: Acoustic speaker diarization and visual localization as joint optimization problem.
ACM Trans. Multim. Comput. Commun. Appl., 2010

Tuning-Robust Initialization Methods for Speaker Diarization.
IEEE Trans. Speech Audio Process., 2010

Guest Editors' Introduction.
Int. J. Semantic Comput., 2010

Cybercasing the Joint: On the Privacy Implications of Geo-Tagging.
Proceedings of the 5th USENIX Workshop on Hot Topics in Security, 2010

Multimodal Indoor Localization: An Audio-Wireless-Based Approach.
Proceedings of the 4th IEEE International Conference on Semantic Computing (ICSC 2010), 2010

3rd international workshop on automated information extraction in media production.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Precise indoor localization using smart phones.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Joke-o-Mat HD: browsing sitcoms with human derived transcripts.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Multimodal location estimation.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Narrative theme navigation for sitcoms supported by fan-generated scripts.
Proceedings of the 3rd International Workshop on Automated Information Extraction in Media Production, 2010

A parallel meeting diarist.
Proceedings of the 2010 International Workshop on Searching Spontaneous Conversational Speech, 2010

Parallelizing Speaker-Attributed Speech Recognition for Meeting Browsing.
Proceedings of the 12th IEEE International Symposium on Multimedia, 2010

Discriminative training for hierarchical clustering in speaker diarization.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

A hybrid approach to online speaker diarization.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Multimodal speaker diarization using oriented optical flow histograms.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

System output combination for improved speaker diarization.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Leveraging speaker diarization for meeting recognition from distant microphones.
Proceedings of the IEEE International Conference on Acoustics, 2010

An adaptive initialization method for speaker Diarization based on prosodic features.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
Prosodic and other Long-Term Features for Speaker Diarization.
IEEE Trans. Speech Audio Process., 2009

Guest Editors' Introduction.
Int. J. Semantic Comput., 2009

Can We Escape the Trough of Disillusionment?
eLearn Mag., 2009

On the Use of Artificial Conversation Data for Speaker Recognition in Cars.
Proceedings of the 3rd IEEE International Conference on Semantic Computing (ICSC 2009), 2009

Visual speaker localization aided by acoustic models.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

Joke-o-mat: browsing sitcoms punchline by punchline.
Proceedings of the 17th International Conference on Multimedia 2009, 2009

Multimodal interfaces for automotive applications (MIAA).
Proceedings of the 14th International Conference on Intelligent User Interfaces, 2009

Using Artistic Markers and Speaker Identification for Narrative-Theme Navigation of Seinfeld Episodes.
Proceedings of the 11th IEEE International Symposium on Multimedia, 2009

Fusing short term and long term features for improved speaker diarization.
Proceedings of the IEEE International Conference on Acoustics, 2009

Multi-modal speaker diarization of real-world meetings using compressed-domain video features.
Proceedings of the IEEE International Conference on Acoustics, 2009

Robust Speaker Diarization for short speech recordings.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

2008
Automated Lecture Recording.
Proceedings of the Encyclopedia of Multimedia, 2nd Ed., 2008

Guest Editors' Introduction.
Int. J. Semantic Comput., 2008

Special Section Introduction: Educational Multimedia.
IEEE Multim., 2008

Multimedia Education in Computer Science: A Little Bit of Everything Is Not Enough.
IEEE Multim., 2008

Anthropocentric Video Segmentation for Lecture Webcasts.
EURASIP J. Image Video Process., 2008

Towards Semantic Analysis of Conversations: A System for the Live Identification of Speakers in Meetings.
Proceedings of the 2th IEEE International Conference on Semantic Computing (ICSC 2008), 2008

Appscio: A Software Environment for Semantic Multimedia Analysis.
Proceedings of the 2th IEEE International Conference on Semantic Computing (ICSC 2008), 2008

Live speaker identification in conversations.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

Multimedia education: can we find unity in diversity?
Proceedings of the 16th International Conference on Multimedia 2008, 2008

A Hardware-Independent Fast Logarithm Approximation with Adjustable Accuracy.
Proceedings of the Tenth IEEE International Symposium on Multimedia (ISM2008), 2008

Modulation spectrogram features for improved speaker diarization.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Two's a crowd: improving speaker diarization by automatically identifying and excluding overlapped speech.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Estimating the dominant person in multi-party conversations using speaker diarization strategies.
Proceedings of the IEEE International Conference on Acoustics, 2008

Overlapped speech detection for improved speaker diarization in multiparty meetings.
Proceedings of the IEEE International Conference on Acoustics, 2008

2007
Object Cut and Paste in Images and Videos.
Int. J. Semantic Comput., 2007

Ubiquitous Pointing and Drawing.
Int. J. Emerg. Technol. Learn., 2007

Reviews.
IEEE Ann. Hist. Comput., 2007

Reviews.
IEEE Ann. Hist. Comput., 2007

Current Multimedia Data Formats and Semantic Computing: A Practical Example and the Challenges for the Future.
Proceedings of the First IEEE International Conference on Semantic Computing (ICSC 2007), 2007

A low-cost mobile pointing and drawing device.
Proceedings of the International Workshop on Educational Multimedia and Multimedia Education 2007, 2007

Using audio and video features to classify the most dominant person in a group meeting.
Proceedings of the 15th International Conference on Multimedia 2007, 2007

The future of multimedia education and educational multimedia.
Proceedings of the International Workshop on Educational Multimedia and Multimedia Education 2007, 2007

Educational multimedia systems: the past, the present, and a glimpse into the future.
Proceedings of the International Workshop on Educational Multimedia and Multimedia Education 2007, 2007

A fast-match approach for robust, faster than real-time speaker diarization.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

2006
Adaptive audio and video processing for electronic chalkboard lectures.
PhD thesis, 2006

Human-Centered Webcasting of Interactive-Whiteboard Lectures.
Proceedings of the Eigth IEEE International Symposium on Multimedia (ISM 2006), 2006

A Practical Approach to Boundary Accurate Multi-Object Extraction from Still Images and Videos.
Proceedings of the Eigth IEEE International Symposium on Multimedia (ISM 2006), 2006

2005
Architecting Multimedia Environments for Teaching.
Computer, 2005

The Virtual Technician: An Automatic Software Enhancer for Audio Recording in Lecture Halls.
Proceedings of the Knowledge-Based Intelligent Information and Engineering Systems, 2005

SIOX: Simple Interactive Object Extraction in Still Images.
Proceedings of the Seventh IEEE International Symposium on Multimedia (ISM 2005), 2005

Towards a Demand Driven, Autonomous Processing and Streaming Architecture.
Proceedings of the 12th IEEE International Conference on the Engineering of Computer-Based Systems (ECBS 2005), 2005

An Interactive Datawall for an Intelligent Classroom.
Proceedings of the Workshop Proceedings DeLFI 2005 und GMW05, 2005

2004
E-Chalk: a lecture recording system using the chalkboard metaphor.
Interact. Technol. Smart Educ., 2004

Web based lectures produced by ai supported classroom teaching.
Int. J. Artif. Intell. Tools, 2004

Teaching with an intelligent electronic chalkboard.
Proceedings of the 2004 ACM SIGMM Workshop on Effective Telepresence, 2004

2003
Web Based Education as a Result of AI Supported Classroom Teaching.
Proceedings of the Knowledge-Based Intelligent Information and Engineering Systems, 2003

Conserving an Ancient Art of Music: Making SID Tunes Editable.
Proceedings of the Computer Music Modeling and Retrieval, International Symposium, 2003

2001
Elektronische Kreide: Eine Java-Multimedia-Tafel fuer den Präsenz- und Fernunterricht.
Inform. Forsch. Entwickl., 2001


  Loading...