Diagnostics and Monitoring

  1. Add the ability to monitor total RAM usage on a VM

    We have a graph that monitors CPU usage, Network traffic, and disk read/write, but it would be very nice to have a graph to show RAM usage on a VM over a period of time (much like the CPU). Especially when deciding to switch between say an A2 and an A5.

    27 votes
    Vote
    Sign in
    (thinking…)
    Sign in with: Microsoft
    Signed in as (Sign out)
    You have left! (?) (thinking…)
    under review  ·  1 comment  ·  Flag idea as inappropriate…  ·  Admin →
  2. Show deployment markers on all trend charts in portal and App Insights

    To make it easier to spot trends and causes, I love to see all charts in the Azure Portal also feature deployments as vertical lines so I can see when any statistic changed due to a deployment.

    Etsy does this for their internal builds and I thought it was an excellent idea and was surprised Azure didn't do this since it could easily do so. See attached for a slide from one of their presentations. The vertical lines represent deployments.

    4 votes
    Vote
    Sign in
    (thinking…)
    Sign in with: Microsoft
    Signed in as (Sign out)
    You have left! (?) (thinking…)
    0 comments  ·  Flag idea as inappropriate…  ·  Admin →
  3. Show custom counters in the Monitoring graph and allow alerts based of custom counters

    I've created a custom counter that I can see using the "Azure Management Studio" tool, but it doesn't show up in the Monitoring graph.
    I would love to be able to set an alert that says when a custom counter is below a certain level send out an alert, but it doesn't show my custom counters in the alert configuration wizard

    6 votes
    Vote
    Sign in
    (thinking…)
    Sign in with: Microsoft
    Signed in as (Sign out)
    You have left! (?) (thinking…)
    0 comments  ·  Flag idea as inappropriate…  ·  Admin →
  4. Reduce the autoscaling and metric collection window (linux and windows)

    Currently, the metrics are collected every 15 minutes when scaling based on CPU. We can reduce the scaling frequency down to 15 minutes causing scaling operations to occur up to 30 minutes after the threshold is met. I'd like to see this time decreased significantly on Linux and windows VM's. In live hosting environments, we should be able to scale in minutes if not seconds based on cpu threshold.

    Thanks!
    -Elliot

    2 votes
    Vote
    Sign in
    (thinking…)
    Sign in with: Microsoft
    Signed in as (Sign out)
    You have left! (?) (thinking…)
    0 comments  ·  Flag idea as inappropriate…  ·  Admin →
  5. Support charts with data for many resources in the application

    We have different metrics for cloud services. CPU/Memory etc.
    But they display only per cloud server.

    I have an application that consist from many clod services: frontend, backend, cache, translation, gis etc.

    I want to see some merged charts. For example, I want to see all services CPU consumption telemetry on one chart to see if all my services are handle the load. I don't want to go to each service panel to see theirs CPU load

    Thank you.

    3 votes
    Vote
    Sign in
    (thinking…)
    Sign in with: Microsoft
    Signed in as (Sign out)
    You have left! (?) (thinking…)
    under review  ·  0 comments  ·  Flag idea as inappropriate…  ·  Admin →
  6. Azure opeartion logs : Hosted service deletion log should have deployment details

    Azure operation logs under Management Services in management portal should have additional information for Hosted service deletion log. It should have the details of deployments deleted along with the hosted service deletion.

    The scenario goes like this : I got a notification as given below from Azure team. Notification mentions some of my deployment Ids which need attention. However, one of the deployment ids in the notification was not present in any of the subscriptions I had. I checked the operation logs and could find hosted service deletion. The log had details of Hosted service and subscription but not the…

    4 votes
    Vote
    Sign in
    (thinking…)
    Sign in with: Microsoft
    Signed in as (Sign out)
    You have left! (?) (thinking…)
    0 comments  ·  Flag idea as inappropriate…  ·  Admin →
  7. Increase DBs "auto-export" feature limitation

    When configuring auto-export for an SQL DB on Azure portal, it works great.

    The only major lack is that we are limited to 10 auto-export schedules, what means that if I have 11 DBs or more that I'd like to backup - I can't use this feature and that's a shame..

    As a premium customer who pays for any bandwidth, I cannot understand why I'm so limited.

    Thank you.

    2 votes
    Vote
    Sign in
    (thinking…)
    Sign in with: Microsoft
    Signed in as (Sign out)
    You have left! (?) (thinking…)
    0 comments  ·  Flag idea as inappropriate…  ·  Admin →
  8. Allow application diagnostics to be enabled for table and blog storage

    In the old portal you could enable application diagnostics to be output to file system, table and blob. In the new portal I can only enable file system. The table and blob options are missing?

    1 vote
    Vote
    Sign in
    (thinking…)
    Sign in with: Microsoft
    Signed in as (Sign out)
    You have left! (?) (thinking…)
    0 comments  ·  Flag idea as inappropriate…  ·  Admin →
  9. 1 vote
    Vote
    Sign in
    (thinking…)
    Sign in with: Microsoft
    Signed in as (Sign out)
    You have left! (?) (thinking…)
    0 comments  ·  Flag idea as inappropriate…  ·  Admin →
  10. Improve the Auto Scaling based on queue feature


    1. currently we can only specify a queue and auto scaling based on the queue size regardless of the message type in the queue, for instance, we have deadletter messages and scheduled messages, but it is only the active messages that needs to be considered for auto-scaling, this impacts the cost significantly.


    2. auto-scaling's up scaling only supports fixed numbers, can we support relative numbers? for instance, calculate based on the numbers of messages in the queue


    2 votes
    Vote
    Sign in
    (thinking…)
    Sign in with: Microsoft
    Signed in as (Sign out)
    You have left! (?) (thinking…)
    0 comments  ·  Flag idea as inappropriate…  ·  Admin →
  11. AZURE Admin Login Notifications by email or SMS.

    I have been plagued by unbelievable security issues with my ISP MediaTemple for more than 2yrs now. After contacting more than 10 ISP's over a 1yr period it seems that there is no functionality in the marketplace to allow the primary ADMIN of a ISP cloud/webhosting account, in this case the admin user of a Azure, to be notified by SMS or email when someone logs into the admin account to view/modify settings.

    An example. Someone sniffed my packets and passwords. Logged into my admin account. Changed DNS and Zone file records and put a MX redirect so that all…

    7 votes
    Vote
    Sign in
    (thinking…)
    Sign in with: Microsoft
    Signed in as (Sign out)
    You have left! (?) (thinking…)
    0 comments  ·  Flag idea as inappropriate…  ·  Admin →
  12. Prevent the immediate deallocation of scalled virtual machines when manually starting

    I have a series of Virtual Machines in a cloud instance which switch on or off based on CPU usage.

    Sometimes I want to manually start all virtual machines to apply some configuration changes.

    Unfortunately the auto scaling causes the machines to be immediately switch back off. I have tried setting the "Scale down wait time" to 60 minutes but this setting is ignored if you manually start the machines.

    The only work around at the moment:

    1) Turn off scaling.
    2) Start all virtual machines and apply configurations
    3) Re-configure scaling settings.

    I suggest you should honor the "Scale…

    1 vote
    Vote
    Sign in
    (thinking…)
    Sign in with: Microsoft
    Signed in as (Sign out)
    You have left! (?) (thinking…)
    0 comments  ·  Flag idea as inappropriate…  ·  Admin →

    So to be clear, you’re suggesting that the cooldown period should be based on any scaling action, not just an autoscaling action? This seems like a reasonable change.

  13. Raise the Alert Monitor Limit per Azure Subscription

    Currently, there is a ten alert limit for cloud services per subscription in Azure. This is functionally useless. We do four deployments per app for geo-redundancy, and if we want to monitor three different things on all our boxes, we're already over the limit. And we have dozens of apps.

    Also, if you do increase the alert count, considering enabling an option to change monitoring level to verbose using the API, so we can automate the creation of monitors for memory usage.

    8 votes
    Vote
    Sign in
    (thinking…)
    Sign in with: Microsoft
    Signed in as (Sign out)
    You have left! (?) (thinking…)
    0 comments  ·  Flag idea as inappropriate…  ·  Admin →
  14. Incorporate old portals management services Alert rules into new portal

    It appears that I can't access the alert rules I defined in the old portal within the new preview portal... are there plans to surface these rules in the new portal?

    3 votes
    Vote
    Sign in
    (thinking…)
    Sign in with: Microsoft
    Signed in as (Sign out)
    You have left! (?) (thinking…)
    0 comments  ·  Flag idea as inappropriate…  ·  Admin →
    started  ·  Stephen Siciliano responded

    This will be addressed soon — any alerts on Virtual Machines or Websites will show up in the new portal.

  15. Audit and IT fundamental services

    Provide unalterable audit logs for services like ACS.

    Provide management stats and backup/recovery mechanisms and services for all provided services.

    12 votes
    Vote
    Sign in
    (thinking…)
    Sign in with: Microsoft
    Signed in as (Sign out)
    You have left! (?) (thinking…)
    0 comments  ·  Flag idea as inappropriate…  ·  Admin →

    Thanks for the feedback. We’ll look into this. Each service has their own auditing capabilities, but we can consider standardization, if that would be helpful for you. Let us know if you have any specifics you’re looking for or ways this could help you.

  16. Enable a "before or after" event to be defined in auto-scaling of Cloud Services

    We domain-join the Virtual Machines associated with our Cloud Service roles. Of course, the process of joining the domain forces a reboot. And, when scaling up, this is fine. However, when scaling-down we want to automatically remove the machines AD Account from the domain. I don't see a way to execute a "before or after" event in Azure auto-scaling. The current prescription is to override the roles OnStop event. However, there is no way to tell if it is a simple reboot or an actual de-allocation.

    We should be able to explicitly define & execute a separate set of code…

    1 vote
    Vote
    Sign in
    (thinking…)
    Sign in with: Microsoft
    Signed in as (Sign out)
    You have left! (?) (thinking…)
    1 comment  ·  Flag idea as inappropriate…  ·  Admin →

    Is this code that you’d want to define inside your role, or, code that you’d want to define in the autoscale setting?

  17. The (auto) scaling graph has a really wierd timebase. The x-Axis needs to autoscale!

    Like the title says, when starting with autoscaling, the graph looks awful and overwrites itsself if there are several scaling actions. The x-axis needs to autoscale and be configurable (time range). Furthermore, the legend needs to be explained, how does cpu percentage fit into the y-axis of the instance count?

    3 votes
    Vote
    Sign in
    (thinking…)
    Sign in with: Microsoft
    Signed in as (Sign out)
    You have left! (?) (thinking…)
    0 comments  ·  Flag idea as inappropriate…  ·  Admin →
  18. Performance Query Widgets on Dashboard get reset on new load

    The Query-Settings for the Dashboard widgets partly reset on reopening the dashboard. Some have extra plots added, others have less, and even others have a different time range set than before. Exactly which setting is reset is always the same on each widget.

    1 vote
    Vote
    Sign in
    (thinking…)
    Sign in with: Microsoft
    Signed in as (Sign out)
    You have left! (?) (thinking…)
    0 comments  ·  Flag idea as inappropriate…  ·  Admin →
  19. Add an Alert Rule so that notification can occur as soon as a web service Web Services DIAGNOSTICS CONNECTION STRINGS response fails

    Add an Alert Rule so that notification can occur as soon as a web service Web Services DIAGNOSTICS CONNECTION STRINGS response fails. Currently it only Alerts on Uptime and Response Times which are averaged over a minimum of 15 minutes.

    1 vote
    Vote
    Sign in
    (thinking…)
    Sign in with: Microsoft
    Signed in as (Sign out)
    You have left! (?) (thinking…)
    under review  ·  0 comments  ·  Flag idea as inappropriate…  ·  Admin →
  20. Scale page > instances graph is incorrect after using "Specific Dates" feature

    As a test, I used the "Specific Dates" feature yesterday to scale up the # of instances for a web role from 1 -> 10 for just 2 hours. The role scaled back down to 1 instance after that interval. However, today the graph shows the historical # of instances pegged at 10 for the previous 5 days, which is incorrect. See attached image.

    0 votes
    Vote
    Sign in
    (thinking…)
    Sign in with: Microsoft
    Signed in as (Sign out)
    You have left! (?) (thinking…)
    0 comments  ·  Flag idea as inappropriate…  ·  Admin →
  • Don't see your idea?

Diagnostics and Monitoring

Categories

Feedback and Knowledge Base