Skip to main content

Dataset / Protein Embedding Analysis

Have a question about this item?

Item information. View source record on contributor's website.

Title
Protein Embedding Analysis
Date Created and/or Issued
2020-01-01 to 2020-06-05
Contributing Institution
UC San Diego, Research Data Curation Program
Collection
Data Science & Engineering Master of Advanced Study (DSE MAS) Capstone Projects
Rights Information
Under copyright
Constraint(s) on Use: This work is protected by the U.S. Copyright Law (Title 17, U.S.C.). Use of this work beyond that allowed by "fair use" or any license applied to this work requires written permission of the copyright holder(s). Responsibility for obtaining permissions and any use and distribution of this work rests exclusively with the user and not the UC San Diego Library. Inquiries can be made to the UC San Diego Library program having custody of the work.
Use: This work is available from the UC San Diego Library. This digital copy of the work is intended to support research, teaching, and private study.
Rights Holder and Contact
Dharma, Arjun; Dedhia, Rahil; Waldschmidt, Thomas B.
Description
Using deep learning transformer model BERT to study protein sequences and prediction tasks such as sub-cellular location, fluorescence, and secondary structure
Research Data Curation Program, UC San Diego, La Jolla, 92093-0175 (https://lib.ucsd.edu/rdcp)
Tasks Assessing Protein Embeddings (TAPE) use license @ https://github.com/songlab-cal/tape/blob/master/LICENSE
Dharma, Arjun; Dedhia, Rahil; Waldschmidt, Thomas B.; Rose, Peter (2020). Protein Embedding Analysis. In Data Science & Engineering Master of Advanced Study (DSE MAS) Capstone Projects. UC San Diego Library Digital Collections. https://doi.org/10.6075/J0KS6Q2H
Type
dataset
Identifier
ark:/20775/bb0892752c
Language
English
Subject
Neural networks (Computer science)
Support Vector Machine (SVM)
Jupyter Notebooks
Protein analysis
Natural Language Processing (NLP)
SciKit Learn
Tensorflow
Pytorch
Protein embedding
XGBoost
Data Science & Engineering Master of Advanced Study (DSE MAS)
Language embedding
Keras
Machine learning
Graphics Processing Unit (GPU)
Logistic regression
Bidirectional Encoder Representations from Transformers (BERT)
Deep learning
Transformers
Task: Regression
DSE MAS - 2020 Cohort

About the collections in Calisphere

Learn more about the collections in Calisphere. View our statement on digital primary resources.

Copyright, permissions, and use

If you're wondering about permissions and what you can do with this item, a good starting point is the "rights information" on this page. See our terms of use for more tips.

Share your story

Has Calisphere helped you advance your research, complete a project, or find something meaningful? We'd love to hear about it; please send us a message.

Explore related content on Calisphere: