I just had about 10 minutes of extremely slow response times - upwards of 5 minutes, though the app didn’t technically crash. During this time, there’s actually a gap in the memory usage in the logs.
I’m curious what could have caused this? I’m using a micro instance on the free tier and I am probably pretty close to maxing that out in terms of memory. I also had run an operation to generate and store a few MB worth of vector embeddings (though the actual inference happens on HF). I was also in the console running du -sh . just to see how big things were.
Are there often fluctuations and outages like this? Would the solution just be to upgrade to a larger instance? Thank you-
I think I may have hit my memory limits from doing various operations. I just deployed a change that uses disc storage instead of memory storage and it cut my usage drastically. (using lancedb instead of docarray - lancedb data is tiny and in git so I assume it’s not ephemeral).
The metrics were originally showing I used about 90% of my memory at rest. Now it’s more like 50%. Hopefully that prevents further interruptions but we’ll see.
Thanks Yann. Luckily I did find a better solution for vector storage and retrieval and it cut my memory usage way down! This is just a prototype so I’m really trying to stay on the free tier.