Computational Statistics in Data Science. Группа авторов

Чтение книги онлайн.

Читать онлайн книгу Computational Statistics in Data Science - Группа авторов страница 45

Computational Statistics in Data Science - Группа авторов

Скачать книгу

      Since the original LSTM model was introduced, many variants have been proposed. Forget gate was introduced in Gers et al. [20]. It has been proven effective and is standard in most LSTM architectures. The forwarding process of LSTM with a forget gate can be divided into two steps. In the first step, the following values are calculated:

      (12)StartLayout 1st Row 1st Column bold-italic z Superscript left-parenthesis t right-parenthesis 2nd Column equals hyperbolic tangent left-parenthesis bold-italic upper W Subscript 1 z Baseline bold-italic x Superscript left-parenthesis t right-parenthesis Baseline plus bold-italic upper W Subscript 2 z Baseline bold-italic h Superscript left-parenthesis t minus 1 right-parenthesis Baseline plus bold-italic b Subscript z Baseline right-parenthesis 2nd Row 1st Column bold-italic i Superscript left-parenthesis t right-parenthesis 2nd Column equals sigma Subscript g Baseline left-parenthesis bold-italic upper W Subscript 1 i Baseline bold-italic x Superscript left-parenthesis t right-parenthesis Baseline plus bold-italic upper W Subscript 2 i Baseline bold-italic h Superscript left-parenthesis t minus 1 right-parenthesis Baseline plus bold-italic b Subscript i Baseline right-parenthesis 3rd Row 1st Column bold-italic f Superscript left-parenthesis t right-parenthesis 2nd Column equals sigma Subscript g Baseline left-parenthesis bold-italic upper W Subscript 1 f Baseline bold-italic x Superscript left-parenthesis t right-parenthesis Baseline plus bold-italic upper W Subscript 2 f Baseline bold-italic h Superscript left-parenthesis t minus 1 right-parenthesis Baseline plus bold-italic b Subscript f Baseline right-parenthesis 4th Row 1st Column bold-italic o Superscript left-parenthesis t right-parenthesis 2nd Column equals sigma Subscript g Baseline left-parenthesis bold-italic upper W Subscript 1 o Baseline bold-italic x Superscript left-parenthesis t right-parenthesis Baseline plus bold-italic upper W Subscript 2 o Baseline bold-italic h Superscript left-parenthesis t minus 1 right-parenthesis Baseline plus bold-italic b Subscript o Baseline right-parenthesis EndLayout

      where upper W and b are weight matrix and bias, and sigma Subscript g Baseline left-parenthesis z right-parenthesis equals StartFraction 1 Over 1 plus exp left-parenthesis z right-parenthesis EndFraction is the sigmoid function.

      The two hidden states bold-italic h Superscript left-parenthesis t right-parenthesis and bold-italic c Superscript left-parenthesis t right-parenthesis are calculated by

      (14)StartLayout 1st Row 1st Column bold-italic h Superscript left-parenthesis t right-parenthesis 2nd Column equals bold-italic o Superscript left-parenthesis t right-parenthesis Baseline ring hyperbolic tangent left-parenthesis bold-italic c Superscript left-parenthesis t right-parenthesis Baseline right-parenthesis EndLayout

stat08316fgz009

      (15)StartLayout 1st Row 1st Column StartFraction delta script l Superscript left-parenthesis upper T right-parenthesis Baseline Over delta bold-italic upper W Subscript 1 f Baseline EndFraction 2nd Column equals sigma-summation Underscript t equals 0 Overscript upper T Endscripts StartFraction delta script l Superscript left-parenthesis upper T right-parenthesis Baseline Over delta bold-italic h Superscript left-parenthesis upper T right-parenthesis Baseline EndFraction StartFraction delta bold-italic h Superscript left-parenthesis upper T right-parenthesis Baseline Over delta bold-italic c Superscript left-parenthesis upper T right-parenthesis Baseline EndFraction left-parenthesis product Underscript j equals t plus 1 Overscript upper T Endscripts StartFraction delta bold-italic c Superscript left-parenthesis j right-parenthesis Baseline Over delta bold-italic c Superscript left-parenthesis j minus 1 right-parenthesis Baseline EndFraction right-parenthesis StartFraction delta bold-italic c Superscript left-parenthesis t right-parenthesis Baseline Over delta bold-italic upper W 
						<noindex><p style= Скачать книгу