Universitat Politècnica de Catalunya

513 Followers

299 SlideShares 513 Followers 49 Followings

Xavier Giro-i-Nieto is an assistant professor at the Universitat Politecnica de Catalunya (UPC). He graduated in Electrical Engineering studies at ETSETB (UPC) in 2000, after completing his master thesis on image compression at the Vrije Universiteit in Brussels (VUB) under the direction of Professor Peter Schelkens. In 2001 he worked in the digital television group of Sony Brussels, before returning to Barcelona and joining the Image Processing Group at the UPC. In 2003, he started teaching courses in Electrical Engineering degress at the EET and ETSETB schools from UPC. He obtained his Phd on image retrieval in 2012, under the supervision by Professor Ferran Marques from UPC and Profess...

deep learning computer vision recurrent neural networks convolutional neural networks generative adversarial networks video processing object detection visual saliency unsupervised learning natural language processing video retrieval neural networks video generative models retrieval multimedia medical imaging visual question answering artificial intelligence image classification audio attention models multimodal deep learning gan video summarization self-supervised learning machine learning imagenet semantic segmentation image processing reinforcement learning perceptron instance segmentation instance retrieval neural machine translation visualization image segmentation adversarial training figure-ground segmentation eeg transfer learning speech synthesis backpropagation object tracking clustering speech recognition lifelogging deep belief network architectures variational autoencoder dqn speech sign language optimization affective computing object candidates upc interpretability wavenet eye fixation object segmentation vision lifelong learning image captioning policy vae human computing crowdsourcing saliency mediaeval search broadcasters archive user interaction wearable cameras spatial transformer video analysis egocentric vision tensorflow domain adaptation face recognition restricted boltzmann machine training nlp video object segmentation image retrieval cross-modal learning keras indexing inception backward propagation speaker identification classification nmt activity locatization optical flow dataset ranking ai hype social 3d convolution event detection person retrieval barcelona barcelonatech face detection annotations 3d reconstruction gnn rbm graph neural networks autoencoder reranking video annotation word embeddings video indexing google web toolkit hpc resnet hate speech gradient descent skip rnn wearables diversity surf event recognition visual scanpath moderation sentiment prediction incremental learning loss function autoregressive models lstm teaching search engine cnn multilayer perceptron pixelcnn computer segmentation diffusion models hierarchical partitions memes deep neural networks brain coding object 3d streaming media iphone http mysql linux ios crowdmm acmmm social event photo clustering instance search python lifeblogging relevance feedback rvos ccma davis visual dialog visual descriptors pattern recognition algorithm email brain-computer interfaces image electroencephalography nearest neighbor bundling interest points rapid serial visual presentation digital images images dementia mattnet web mutual reinforcement algorithm mediavela q-learning workshop pixable time series seq2seq representation learning explainable ai columbia regions phd thesis xai hyperlinking bag of features minecraft visual grounding trecvid signal bci nist interface twitter television interactive microblogging tv labeling game genai generative learning endoscopy etsetb telecom mobilitat erasmus javascript web toolkit wt c++ mild cognitive impairment html web service ai for social good perpcetron web interface semantic shots image edge detection image representation professional documentalists video signal processing semimanual solution moco automatic keyframe selection companies autonomous driving panoptic segmentation broadcasting single representative keyframe algorithm design and analysis multimedia communication ethics language action classification action detection motion estimation cbir deep captioning remote sensing activity recognition lipreading self-learning soundnet sonorization dynamic computation iclr2018 local geometry language model visual localization googlenet skip connections deep q-network network in network densenet nin skip thought video segmentation pixelrnn dbn adaptive computation time methodology error function rework supercomputing gru lip reading softmax logistic regression linear regression active learning interestingness higher education rgbd multiview 3d images depth joint embeddings action recognition software t-sne epoch batch visual reasoning astronomy space policy gradients incrmental learning theano software development catastrophic forgetting normalizing flows caption natural language memorability neural architectures vgg eye tracker automl resnext attributes college outreach aprenentatge automàtic inteligència artificial robots convnet social networks open source alzheimer narrative computational graph chain rule referring expressions word2vec biometrics data partition set learning vanishing gradient 3d analysis point clouds relu mlp catalunya catalonia cgan colorization cross-entropy batch normalization mini-batch location retrieval ealy stopping coclustering adam sgd search engine optimization cloud computing markov decision proccess data augmentation value value function computing

Activity
About

Universitat Politècnica de Catalunya

Presentations

Crowdsourced Object Segmentation with a Game

Cristina Ruiz Sancho, "Tweet@TV, la Televisió Social en 140 caràcters"

Eva Mohedano, "Investigating EEG for Saliency and Segmentation Applications in Image Processing"

UPC at MediaEval Hyperlinking 2013

Part-based Object Retrieval with Binary Partition Trees

UPC at MediaEval Social Event Detection 2013

Automatic Keyframe Selection based on Mutual Reinforcement Algorithm

Interfície web per l’annotació semi-automàtica de plans semàntics

Reordenació i agrupament d'imatges d'una cerca de vídeo

Interactive Image Processing Demos for the Web

Co-advised Thesis in ETSETB mobility program

Servei de vídeos a la carta per a l'iPhone.

Photo Clustering of Social Events by Extending PhotoTOC to a Rich Context

Rich Internet Application for Semi-Automatic Annotation of Semantic Shots on Keyframes

Extensió d'una interfície de cerca d'imatges a les consultes amb regions

Búsqueda Visual con Retroacción de Relevancia basada en Actualización de Pesos

Interfaz gráfica de usuario para la búsqueda de imágenes basada en imágenes

Content based video summarization into object maps

Bundling interest points for object classification

Low computational cost algorithms for photo clustering and mail signature detection in the cloud

Contextless Object Recognition with Shape-enriched SIFT and Bags of Features

Visual instance mining of news videos using a graph-based approach

Exploiting User Interaction and Object Candidates for Instance Retrieval and Object Segmentation

Object segmentation in images using EEG signals

UPC at MediaEval 2014 Social Event Detection Task

Pyxel, una llibreria per a l’anotació automàtica de fotografies

Co-filtering human interaction and object segmentation

Visual Summary of Egocentric Photostreams by Representative Keyframes (BSc Ricard Mestre)

Relevance feedback for image retrieval with EEG signals

Aplicació rica d'internet per a la consulta amb text i imatge a la Corporació Catalana de Mitjans Audiovisuals.

Likes

The Impact of Segmentation on the Accuracy and Sensitivity of a Melanoma Classifier Based on Skin Lesion Images

Visual Information Retrieval: Advances, Challenges and Opportunities