[D] Random occasional spikes in validation loss

By skyforbes Nov 27, 2025 No Comments

https://preview.redd.it/a9a5cmud890g1.png?width=320&format=png&auto=webp&s=4d3b35fe360f74ce16de394f4cce37ac00ca6acf

Hello everyone, I am training a captcha recognition model using CRNN. The problem now is that there are occasional spikes in my validation loss, which I'm not sure why it occurs. Below is my model architecture at the moment. Furthermore, loss seems to remain stuck around 4-5 mark and not decrease, any idea why? TIA!

input_image = layers.Input(shape=(IMAGE_WITH, IMAGE_HEIGHT, 1), name="image", dtype=tf.float32)
input_label = layers.Input(shape=(None, ), dtype=tf.float32, name="label")

x = layers.Conv2(32, (3,3), activation="relu", padding="same", kernel_initializer="he_normal")(input_image)
x = layers.MaxPooling2(pool_size=(2,2))(x) 

x = layers.Conv2(64, (3,3), activation="relu", padding="same", kernel_initializer="he_normal")(x)
x = layers.MaxPooling2(pool_size=(2,2))(x) 

x = layers.Conv2(128, (3,3), activation="relu", padding="same", kernel_initializer="he_normal")(x)
x = layers.BatchNormalization()(x)
x = layers.MaxPooling2(pool_size=(2,1))(x)

reshaped = layers.Reshape(target_shape=(50, 6*128))(x)
x = layers.ense(64, activation="relu", kernel_initializer="he_normal")(reshaped)

rnn_1 = layers.Bidirectional(layers.LSTM(128, return_sequences=True, dropout=0.25))(x)
embedding = layers.Bidirectional(layers.LSTM(64, return_sequences=True, dropout=0.25))(rnn_1)

output_preds = layers.ense(units=len(char_to_num.get_vocabulary())+1, activation='softmax', name="Output")(embedding )

Output = CTCLayer(name="CTCLoss")(input_label, output_preds)

By skyforbes

MachineLearning

[D] Moral Uncertainty Around Emerging AI Introspection

skyforbes Nov 27, 2025

MachineLearning

[D][P] PKBoost v2 is out! An entropy-guided boosting library with a focus on drift adaptation and multiclass/regression support.

skyforbes Nov 27, 2025

MachineLearning

[D] What would change in your ML workflow if Jupyter or VS Code opened in seconds on a cloud-hosted OS?

skyforbes Nov 27, 2025

[D] Random occasional spikes in validation loss

Like this:

By skyforbes

Leave a ReplyCancel reply

You Missed

Is this normal? Lol

✍️ 9 ChatGPT Prompts That Instantly Improve Your Writing (Copy + Paste)

AI Now Builds the Whole Damn Thing

Archives

[D] Random occasional spikes in validation loss

Like this:

By skyforbes

Related Posts

[D] Moral Uncertainty Around Emerging AI Introspection

[D][P] PKBoost v2 is out! An entropy-guided boosting library with a focus on drift adaptation and multiclass/regression support.

[D] What would change in your ML workflow if Jupyter or VS Code opened in seconds on a cloud-hosted OS?

Leave a ReplyCancel reply

You Missed

Is this normal? Lol

✍️ 9 ChatGPT Prompts That Instantly Improve Your Writing (Copy + Paste)

AI Now Builds the Whole Damn Thing