featureextraction（深度学习，特征提取）_深度学习特征提取资源-CSDN下载

共273个文件

py：106个

jpg：104个

rst：14个

需积分: 47 156 浏览量 2019-03-27 21:16:57 上传评论 11 收藏 15.56MB ZIP 举报

特征提取在深度学习中扮演着至关重要的角色，它是模型理解和解决问题的基础。本资源主要探讨了如何利用深度学习技术进行有效的特征提取。"feature extraction（深度学习，特征提取）"这个主题涵盖了一系列方法和技术，旨在从原始数据中抽取具有代表性的、可用于机器学习任务的结构化信息。深度学习是一种模仿人脑神经网络结构的机器学习方法，它通过多层非线性变换对输入数据进行逐层抽象，从而自动学习到复杂的数据表示。在特征提取过程中，深度学习模型如卷积神经网络（CNN）、循环神经网络（RNN）和自注意力机制等，可以捕捉图像、文本、声音等不同类型数据的内在模式。 1. 卷积神经网络（CNN）：在图像处理领域，CNN是最常用的特征提取工具。其通过卷积层、池化层和激活函数等组件，能够识别出图像中的边缘、纹理、形状等低级特征，并逐步构建出更高级别的语义特征。 2. 循环神经网络（RNN）：RNN在序列数据处理中表现出色，如自然语言处理。通过循环结构，RNN可以捕获时间序列中的依赖关系，LSTM（长短期记忆网络）和GRU（门控循环单元）是RNN的改进版本，能更好地解决长期依赖问题。 3. 自注意力机制：自注意力在Transformer模型中首次提出，用于机器翻译。它允许模型同时考虑整个输入序列的信息，而不仅仅是局部上下文，这对于处理序列数据如文本非常有效。在"pliers-master"这个压缩包中，可能包含了一个名为"pliers"的Python库，该库可能提供了各种工具和方法来帮助用户进行特征提取。Pliers可能包括以下功能： 1. 图像特征提取：利用预训练的CNN模型（如VGG、ResNet等）提取图像的特征向量。 2. 音频特征提取：例如MFCC（梅尔频率倒谱系数）和SpectralContrast，用于音频分类和情感分析。 3. 文本处理：提供词嵌入（Word Embeddings）转换，如预训练的GloVe或Word2Vec模型，以及NLP任务的预处理工具。 4. 视频处理：通过帧抽取和帧级别的特征提取，将视频转化为可学习的表示。通过这些工具，开发者和研究者可以轻松地从不同类型的输入数据中提取有用的特征，进而应用于各种深度学习模型，如分类、回归、聚类或生成模型等。在实际应用中，理解并熟练掌握特征提取的方法对于提升模型性能至关重要，而"pliers"这样的库则为这一过程提供了便利。在使用过程中，遇到任何疑问或不解，都可以与社区进行交流和讨论，共同进步。

资源推荐

资源详情

资源评论

收起资源包目录

feature extraction（深度学习，特征提取）（273个子文件）

make.bat 804B

simple_vectors.bin 1KB

setup.cfg 39B

.coveragerc 54B

Dockerfile 2KB

google.json.enc 2KB

.gitignore 120B

Quickstart.ipynb 125KB

Simple Graph.ipynb 22KB

Vision APIs.ipynb 9KB

Speech Sentiment Analysis.ipynb 7KB

Scikit-Learn Integration.ipynb 4KB

thai_people.jpg 497KB

8096339358_84c0bc9260_z.jpg 327KB

5146065990_135ba6fe58_z.jpg 293KB

30183352652_7d749bb0fd_z.jpg 276KB

7698347366_144600f7d6_z.jpg 274KB

38002048802_6fb9cb2f84_z.jpg 259KB

10283671495_c73418908c_z.jpg 247KB

32830552834_7bc70281d7_z.jpg 240KB

29871525835_4b2e4f80a6_z.jpg 227KB

31371906655_8dc06343a8_z.jpg 221KB

33553137600_101dd46468_z.jpg 208KB

31712956954_b9220330bf_z.jpg 208KB

8096376497_61e444f136_z.jpg 198KB

28010844841_c5b81cb9cc_z.jpg 178KB

8401191294_bb97a97986_z.jpg 178KB

6801011784_a2f8893d8a_z.jpg 174KB

8731397003_0355a28f7c_z.jpg 162KB

37399855570_3ba00dbf09_z.jpg 160KB

20756214143_23032f6891_z.jpg 159KB

35355223984_3966636550_z.jpg 153KB

32649656714_db5c473433_z.jpg 152KB

37061023154_5645de0902_z.jpg 152KB

37374212932_3d555e249d_z.jpg 151KB

30862486703_dfdceb4989_z.jpg 150KB

33870427205_793c8249d7_z.jpg 149KB

32349545933_789dd5a456_z.jpg 148KB

obama.jpg 147KB

7951633564_e5470114d7_z.jpg 147KB

30341054424_bebc63c1ce_z.jpg 143KB

37721937066_ca5b374fde_z.jpg 139KB

36267190863_7d32cc92bb_z.jpg 138KB

31314448220_894c6c13f3_z.jpg 137KB

30780280045_74ef3131c3_z.jpg 137KB

24170311598_4eb66aeac1_z.jpg 132KB

26856768610_389de33064_z.jpg 131KB

25641419473_54cffe37a9_z.jpg 130KB

7345975816_bc90fc1a37_z.jpg 130KB

34967781465_52f131bede_z.jpg 130KB

14648756284_b20715a108_z.jpg 129KB

37207822056_61445cf639_z.jpg 129KB

36870158813_e10b57a118_z.jpg 128KB

15722811206_fe70a96104_z.jpg 127KB

27767882594_e5d7b250e8_z.jpg 127KB

37556761022_257ed4f44f_z.jpg 126KB

34255900146_45f2d8682e_z.jpg 122KB

15645327126_57990a5dc8_z.jpg 122KB

37873094362_a317d1a104_z.jpg 121KB

33366522142_4004ae8968_z.jpg 121KB

29534224545_9c7c36b3f3_z.jpg 120KB

36824784452_dc3035149e_z.jpg 119KB

28228558275_b477199825_z.jpg 118KB

36687175630_1539222a2b_z.jpg 115KB

28138956861_b90d92e9bc_z.jpg 114KB

8732236862_ac891f5599_z.jpg 113KB

36824783582_2eca9e7c8d_z.jpg 111KB

33176208984_5e376f1372_z.jpg 110KB

37310568722_765c8ff243_z.jpg 108KB

32581855643_5e20d74065_z.jpg 108KB

34782356541_a677f68483_z.jpg 108KB

37089195076_915554315b_z.jpg 107KB

37207817106_350e84b116_z.jpg 104KB

36913234812_cd5c6a68af_z.jpg 104KB

21748993669_8b41319d6f_z.jpg 99KB

33582196153_1bf767dccc_z.jpg 97KB

32506705470_ccb3566861_z.jpg 97KB

32256738804_a9f1a569de_z.jpg 95KB

35131496752_1bafb7a775_z.jpg 94KB

28562315691_33efe5b811_z.jpg 93KB

7974346819_6eb588bd0f_z.jpg 88KB

8401428972_0e9ee7a1b4_z.jpg 86KB

36808022704_72650cbb3a_z.jpg 84KB

28151537031_e3a149c5cb_z.jpg 83KB

36661269343_4fbb9de127_z.jpg 83KB

23974518458_dab6af97b7_z.jpg 83KB

29361778531_dcb519f6ff_z.jpg 82KB

36855733371_2d25b58cec_z.jpg 82KB

30546732920_b6bf819f18_z.jpg 81KB

27595577423_e1375b51c8_z.jpg 81KB

36913109792_7216d52979_z.jpg 80KB

32004375634_9f38887964_z.jpg 79KB

4416537561_27d7083571_z.jpg 79KB

27859839724_76a1b84eba_z.jpg 77KB

32835385446_619a5d6afc_z.jpg 77KB

3906400735_d5f6dce996_z.jpg 75KB

32693759262_597b772c09_z.jpg 75KB

36371178153_000bea4488_z.jpg 71KB

2980051095_27c491a67d_z.jpg 71KB

14540828060_6c280fc3ef_z.jpg 70KB

共 273 条

# pliers A Python 3 package for automated feature extraction. ## Status * [![Build Status](https://travis-ci.org/tyarkoni/pliers.svg?branch=master)](https://travis-ci.org/tyarkoni/pliers) * [![Coverage Status](https://coveralls.io/repos/github/tyarkoni/pliers/badge.svg?branch=master)](https://coveralls.io/github/tyarkoni/pliers?branch=master) ## Overview Pliers is a Python package for automated extraction of features from multimodal stimuli. It provides a unified, standardized interface to dozens of different feature extraction tools and services--including many state-of-the-art deep learning-based APIs. It's designed to let you rapidly and flexibly extract all kinds of useful information from videos, images, audio, and text. You might benefit from pliers if you need to accomplish any of the following tasks (and many others!): * Identify objects or faces in a series of images * Transcribe the speech in an audio or video file * Apply sentiment analysis to text * Extract musical features from an audio clip * Apply a part-of-speech tagger to a block of text Each of the above tasks can typically be accomplished in 2 - 3 lines of code with pliers. Combining them *all*--and returning a single, standardized, integrated DataFrame as the result--might take a bit more work. Say maybe 5 or 6 lines. In a nutshell, pliers provides an extremely high-level, unified interface to a very large number of feature extraction tools that span a wide range of modalities. ## How to cite If you use pliers in your work, please cite both the pliers GitHub repository (http://github.com/tyarkoni/pliers) and the following paper: > McNamara, Q., De La Vega, A., & Yarkoni, T. (2017, August). [Developing a comprehensive framework for multimodal feature extraction](https://dl.acm.org/citation.cfm?id=3098075). In Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (pp. 1567-1574). ACM. ## Documentation The official pliers documentation is quite thorough, and contains a comprehensive [quickstart](http://tyarkoni.github.io/pliers/quickstart.html) doc (also available below), [user guide](http://tyarkoni.github.io/pliers/) and complete [API Reference](http://tyarkoni.github.io/pliers/reference.html). ## Installation For the latest release: > pip install pliers Or, if you want to work on the bleeding edge: > pip install pliers git+https://github.com/tyarkoni/pliers.git ### Dependencies By default, installing pliers with pip will only install third-party libraries that are essential for pliers to function properly. These libraries are listed in requirements.txt. However, because pliers provides interfaces to a large number of feature extraction tools, there are literally dozens of other optional dependencies that may be required depending on what kinds of features you plan to extract (see optional-dependencies.txt). To be on the safe side, you can install all of the optional dependencies with pip: > pip install -r optional-dependencies.txt Note, however, that some of these Python dependencies have their own (possibly platform-dependent) requirements. For example, python-magic requires libmagic (see here for installation instructions), and without this, you’ll be relegated to loading all your stims explicitly rather than passing in filenames (i.e., `stim = VideoStim('my_video.mp4')` will work fine, but passing 'my_video.mp4' directly to an `Extractor` may not). Additionally, the Python OpenCV bindings require OpenCV3--but relatively few of the feature extractors in pliers currently depend on OpenCV, so you may not need to bother with this. Similarly, the `TesseractConverter` requires the tesseract OCR library, but no other `Transformer` does, so unless you’re planning to capture text from images, you’re probably safe. ### API Keys While installing pliers itself is usually straightforward, setting up some of the web-based feature extraction APIs that pliers interfaces with can take a bit more effort. For example, pliers includes support for face and object recognition via Google’s Cloud Vision API, and enables conversion of audio files to text transcripts via several different speech-to-text services. While some of these APIs are free to use (and virtually all provide a limited number of free monthly calls), they all require each user to register for their own API credentials. This means that, in order to get the most out of pliers, you’ll probably need to spend some time registering accounts on a number of different websites. More details on API key setup are available [here](http://tyarkoni.github.io/pliers/installation.html#api-keys). ## Quickstart A detailed user guide can be found in the [pliers documentation](http://tyarkoni.github.io/pliers/); below we provide a few brief examples illustrating the flexibility and utility of the package. An executable Jupyter Notebook containing all of the examples can be found [here](https://github.com/tyarkoni/pliers/blob/master/examples/Quickstart.ipynb). ### Face detection This first example uses the face_recognition package's location extraction method to detect the location of Barack Obama's face within a single image. The tools used to do this are completely local (i.e., the image isn't sent to an external API). We output the result as a pandas DataFrame; the `'face_locations`' column contains the coordinates of the bounding box in CSS format (i.e., top, right, bottom, and left edges). ```python from pliers.extractors import FaceRecognitionFaceLocationsExtractor # A picture of Barack Obama image = join(get_test_data_path(), 'image', 'obama.jpg') # Initialize Extractor ext = FaceRecognitionFaceLocationsExtractor() # Apply Extractor to image result = ext.transform(image) result.to_df() ``` <div> <table border="1" class="dataframe"> <thead> <tr style="text-align: right;"> <th></th> <th>onset</th> <th>order</th> <th>duration</th> <th>object_id</th> <th>face_locations</th> </tr> </thead> <tbody> <tr> <th>0</th> <td>NaN</td> <td>NaN</td> <td>NaN</td> <td>0</td> <td>(142, 349, 409, 82)</td> </tr> </tbody> </table> </div> ### Face detection with multiple inputs What if we want to run the face detector on multiple images? Naively, we could of course just loop over input images and apply the Extractor to each one. But pliers makes this even easier for us, by natively accepting iterables as inputs. The following code is almost identical to the above snippet. The only notable difference is that, because the result we get back is now also a list (because the features extracted from each image are stored separately), we need to explicitly combine the results using the `merge_results` utility. ```python from pliers.extractors import FaceRecognitionFaceLocationsExtractor, merge_results images = ['apple.jpg', 'obama.jpg', 'thai_people.jpg'] images = [join(get_test_data_path(), 'image', img) for img in images] ext = FaceRecognitionFaceLocationsExtractor() results = ext.transform(images) df = merge_results(results) df ``` <div> <table border="1" class="dataframe"> <thead> <tr style="text-align: right;"> <th></th> <th>source_file</th> <th>onset</th> <th>class</th> <th>filename</th> <th>stim_name</th> <th>history</th> <th>duration</th> <th>order</th> <th>object_id</th> <th>FaceRecognitionFaceLocationsExtractor#face_locations</th> </tr> </thead> <tbody> <tr> <th>0</th> <td>/Users/tal/Dropbox/Code/pliers/pliers/tests/da...</td> <td>NaN</td> <td>ImageStim</td> <td>/Users/tal/Dropbox/Code/pliers/pliers/tests/da...</td> <td>obama.jpg</td> <td></td> <td>NaN</td> <td>NaN</td> <td>0</td> <td>(142, 349, 409, 82)</td> </tr> <tr> <th>1</th> <td>/Users/tal/Dropbox/Code/pliers/pliers/tests/da...</td> <td>NaN</td>

评论收藏

内容反馈