Questions tagged [deep-neural-networks]

For questions related to deep neural networks, which are artificial neural networks with "many" layers, where "many" can vary depending on the context.

110questions
Filter by
Sorted by
Tagged with
0
0答案
8views

分离LSTMs或一个全球性的相关要素集群

我有一个$ n $的维时间序列适用于LSTM,$ n $的是功能,每个时间点的数量。这些功能可以根据自己的理念进行集群,例如$ N_1,...,N_4 $是...
0
0答案
五views

Tensorboard不反映记录的数据[迁移]

I am new to using tensorboard and I wanted to get my own graph on the tensorboard, outside of training. I am working on Google Colabs and the code that I wrote is, ...
1
vote
1回答
33次

我们可以用一个解码器(GPT,变压器-XL)培训的编码器(BERT,XLM)预先建立一个聊天机器人,而不是语言翻译?

I was wondering if theBARTT5机型能够做到在英语句子生成的任务。必威电竞大部分车型我已经提到...
3
1回答
35的观点

什么是建立深厚的Q-网络的正确方法吗?

I'm new to RL and to deep q-learning and I have a simple question about the architecture of the neural network to use in an environment with a continous state space a discrete action space. I tought ...
0
0答案
67次

了解本DQN算法目标网络的作用

I've found online this interesting algorithm: From what I understand reading this algorithm, I can't figure out why I should "perform the opposite action" and consequently storing that second ...
1
vote
0答案
18views

How can I find the similar non-zero connections between different levels of sparsity of the same network?

我修剪神经网络(CNN和密集),并针对不同层次的稀疏性,我有不同的子网络。说为20%,40%,60%和80%水平的稀疏性,我有4个不同的子网络。现在,...
1
vote
1回答
22次

使用单个神经元或DNN随机值发生器

AI is supposed to do anything human or traditional computer can do, that is what we expect AI to be. So 'generating random value' is also a task included in the scope that AI should be able to do I'...
0
0答案
8views

如何稳定转化率连体神经网络的培训,如果培训的不同之后的结果而变化相对强烈?

我正在训练使用MSE和ADAM优化神经网络。更确切地说,一个连体建筑与卷积编码器和顶部欧氏距离。我使用MSE,因为我有不同的...
0
0答案
11views

深网带约束或辅助功能

我目前的神经网络的目标是预测一个标签。该数据集包含了一些功能,也就是在交易中的标签$ Y_I $ $ I $,表示它的分类。还有一个特点$ F ^ {I} ...
2
0答案
23views

Why Pixel RNN (Row LSTM) can capture triangular contexts?

I'm reading the paper Pixel Recurrent Neural Network. I have a question about Row LSTM. Why Row LSTM can capture triangular contexts? In this paper, the kernel of the one-dimensional convolution ...
3
0答案
38次

Are there any commonly used discontinuous activation functions?

Are there any commonly used activation functions (e.g. that take values in $(0,.5)\cup (.5,1)$)? Preferably for classification? Why? I was looking for commonly used activation functions on Google, ...
1
vote
1回答
25次

How to use convolution neural network in Deep-Q?

I currently have a grid of pixels 20x20. Each pixel can be red green blue or black. So I have one hot-encoded the pixels giving a 20x20x4 array for each screen. For my Deep-Q Network, I have ...
2
1回答
69次

How to detect vanishing gradients?

编辑:我返工我的问题来概括更好,更切合主题,并且大部分是软件实现无关。可以消失梯度在分配(或缺乏变化来检测...
1
vote
0答案
21views

步骤培训和再培训的良好模型

I'm still a bit new to deep learning. What I'm still struggling, is what is the best practice in re-training a good model over time? I've trained a deep model for my binary classification problem (...
1
vote
2答案
48次

Training accuracy vs validation accuracy on deep models

我训练了深刻的网络Kerason some images for a binary classification (I have around 12K images). Once in a while, I collect some false positives and add ...

1五 30 五0 per page
1
2 3 4
...
8