Azure sentinal alert vs monitor alert - azure-monitoring

In Azure kusto query window We can create two alerts
Azure Monitor Alert
Azure Sentinel Alert
What are the difference between these.

These are two different things. In general Azure Monitor alert will be related to a performance metric type of alert maybe when your CPU on VM is at 75% and you want to be notified about it via e-mail/SMS/webhook etc. and Sentinel alert is being triggered when there has been some security alert in your environment for example Failed SSH brute force attack, the Sentinel alerts are being correlated into actionable security incidents to reduce noise.
Hope it helps :)

Related

HiveMQ/RabbitMQ as load balancing MQTT node(s) before Thingsboard IoT system

Our endpoint devices are pushing data over MQTT to an IoT system based on the Thingsboard IoT platform. There is only one MQTT topic called /telemetry where all devices connect. The server knows which device the data belongs to based on the device's token used as the MQTT username.
Due to not rare peaks of data loading, outages happen.
My question is:
Is it possible and how to use HiveMQ (RabbitMQ or some similar product) between devices and our IoT system to avoid data loss and smooth out peaks?
This post explains how to use Quality of Service levels, offline buffering, throttling, automatic reconnect and more to avoid data loss and maintain uptime.
The tldr; is that MQTT and HiveMQ have features built in to help avoid data loss, guaranteed delivery, traffic spikes and to handle back-pressure.
It may be worth considering what you can do with your existing tools before expanding your deploy footprint which just adds unnecessary complexity if unwarranted.
I would recommend using Apache Kafka or Confluent in between the MQTT Broker & ThingsBoard. Kafka stores all data on disk (instead of RAM in the case of RabbitMQ) and is scalable among multiple cluster nodes. You could also reload data to ThingsBoard by resetting offsets. This could be useful if there was an error in the configuration of a rulechain and you would have ThingsBoard reprocess the data again.
To connect with Kafka/Confluent you can use the ThingsBoard Integration.
Find more details here:
https://medium.com/python-point/mqtt-and-kafka-8e470eff606b

Network partition detected in RabbitMQ

I have a RabbitMQ cluster with 3 nodes. The system is live and frequently there is a network partition error.
Network partition detected
Mnesia reports that this RabbitMQ cluster has experienced a network
partition. There is a risk of losing data.
I want to receive an email notification when this event occurs in the RabbitMQ.
Is there a way to receive a notification from RabbitMQ if there is any network partition issue in the cluster?
you can configure prometheus, and with it, you can integrate mail, alerts etc..
there is also a video about that: https://www.youtube.com/watch?v=NWISW6AwpOE
frequently issue is not acceptable for rabbitmq.
so should avoid this situation.
in my case, if you using cluster with docker , met it frequently.
whilc change to normal installation. issue gone.

Redis connection settings for app "surviving" redis connectivity issues

I'm using azure redis cache for certain performance monitoring services. Basically when events like page loads, etc occur, I send a fire and forget command to redis to record the event. My goal is for my app to function fine whether or not it can contact the redis server. I'm looking for a best practice for this scenario. I would be OK with losing some events if necessary. I've been finding that even though I'm using fire and forget, the app staggers when the web server runs into high latency or connectivity issues with the server.
I'm using StackExchange.Redis. Any best practice configuration options/programming practices for this scenario?
The way I was implementing a singleton pattern on the connection turned out to be blocking requests. Once I fixed this my app behaves as I want (e.g. it still functions when redis connection dies).

what to check if automatic failover does not work on always on availability group?

on one of my environment automatic failover does not work. what things i need to check, please help me on this?
https://support.microsoft.com/en-us/help/2833707/troubleshooting-automatic-failover-problems-in-sql-server-2012-alwayson-environments
The symptoms when automatic failover is unsuccessful
If an automatic failover event is not successful, the secondary
replica does not successfully transition to the primary role.
Therefore, the availability replica will report that this replica is
in Resolving status. Additionally, the availability databases report
that they are in Not Synchronizing status, and applications cannot
access these databases.
For example, in the following image, SQL Server Management Studio
reports that the secondary replica is in Resolving status because the
automatic failover process was unable to transition the secondary
replica into the primary role:

Monitor apache connections

How do i monitor how many connections apache is getting? Something like http://www.cyberciti.biz/faq/apache-server-status/ . Basically i need some tool that will send an email when the number of connections exceed a specified limit. I am not able to find any which would give me the server side statistics of the live server. All that i got is related to simulate the real instance. Please help me if any.
mod_status makes the information available - you just need somethnig to poll the page and report.
Nagios provides a great platform for implementing monitoring (scheduling / alerting / reporting / escalation / automatic responses), while there are at least 2 plugins (check_apachestatus.pl and check_apache2.sh) which will report on concurrent connections.