Forum Replies Created

Viewing 15 posts - 241 through 255 (of 258 total)
  • Author
    Posts
  • Telium Support Group
    Moderator
    Post count: 263
    in reply to: Logs not rotating #6617

    The HAAst log file or its parent directory has world write permissions, and newer versions of logrotate do not allow this to rotate. Manually running logrotate shows the results below:

    $logrotate -d -v haast
    reading config file haast
    Handling 1 logs
    rotating pattern: /var/log/haast after 1 days (7 rotations)
    empty log files are rotated, old logs are removed
    considering log /var/log/haast
    error: skipping “/var/log/haast” because parent directory has insecure permissions

    The solution is to uncomment the line in the /etc/logrotate.d/haast file to allow rotating regardless of permission:

    su root root

    Telium Support Group
    Moderator
    Post count: 263

    HAAst performance data written to the database is recorded using the timetamp of the source system, and they are displayed in the order they arrived. If entries from different servers appear out of order, that means that the clocks on the two servers are out of sync.

    The solution is to install the network time service (NTP) on both peers and ensure they synchronize from the same source.

    Telium Support Group
    Moderator
    Post count: 263

    Something significant has changed in regards to the computer’s hardware profile. If you are attempting to move your license to a new hardware platform please contact support@telium.io for assistance.

    If you are not moving your license to another hardware platform and have not changed the hardware in any way then something else has gone wrong and we’re here to help. This symptom has been reported with HAAst installed in a virtual machine and the VM has undergone a virtual hardware revision upgrade. Similarly, if you have modified various BIOS settings your system might report hardware differently.

    As a workaround restart the HAAst service, or restart the virtual machine guest. Please generate a new license request and send it, along with a haast log file showing your failed restart attempt (including the license rejection messages), and we will generate a new key for you and help you figure out what changed (so long as the system still has a maintenance agreement in place).

    Please note that physical and virtual machines do not just change their configurations on their own – so be careful you don’t invalidate your license by experimenting with hardware or VM settings. Activate your license only after you have finished configuring your hardware.

    Telium Support Group
    Moderator
    Post count: 263

    You are experiencing a network/routing issue somewhere on your network. It could be an ARP table is corrupt somewhere, or a device is simply not routing packets out the interface it should. This type of problem is common with multihoming (this is not a HAAst issue, it’s a network issue).

    A complete solution is beyond the scope of this document. In fact, understanding and diagnosing ARP cache and routing problems is complex – don’t expect a simple answer. If you want to ensure that your Asterisk server is sending packets out the right interface on a multihomed system, you can set an iproute policy as follows:

    1. edit the /etc/iproute2/rt_tables file and add the following line:

    200 haastrt


    2. add the following two lines to your asterisk.start.pre file (assuming the device you are dynamically creating is called eth0.haast):

    ip rule add dev eth0.haast table haastrt
    ip route add default dev eth0.haast table haastrt


    3. Reboot your system and retry the ping test mentioned above.

    If the problem remains then the most likely cause if your router.

    Telium Support Group
    Moderator
    Post count: 263

    The snooze times for promote/demote for one or both peers is too short.

    For each peer, time how long the peer requires to fully promote and fully demote (include pre/post scripts). Then add 5 seconds to each (or more depending on variability of the system performance) and set the promote and demote snooze times accordingly.

    Restart HAAst and retry manual failover again. If dual active/standby contention continues then increase the snooze times in steps of 5 seconds and repeat.

    Telium Support Group
    Moderator
    Post count: 263

    The HAAst socket file is not accessible by the web server process. (Assuming HAAst is in fact running)

    Check permissions of the socket file and the directory containing the file. As of HAAst version 2.2 the socket file is stored in the /run directory. In previous
    versions of HAAst the socket file was stored in the /tmp directory.

    Telium Support Group
    Moderator
    Post count: 263

    Although unusual, this suggests that your FreePBX AstDB is corrupt or incomplete, and FreePBX is missing database entries needed by the dialplan. This can occur if you have installed a buggy FreePBX update, or FreePBX has crashed, etc.

    There are four possible ways to correct the Asterisk database.

    1. Restore the node from a backup
    2. Request that FreePBX regenerate the Asterisk database using this BASH command (after stopping Asterisk). Note that some versions of FreePBX don’t support this command:

      curl https://SERVER/freepbx/admin/config.php?type=setup&display=devices&action=resetall

    3. Recreate the Asterisk DB manually.
      • Stop FreePBX
      • Delete the /var/lib/asterisk/astdb.sqlite3
      • Start FreePBX
      • Edit & Submit each user
      • Click apply
    4. Stop FreePBX on both peers and copy the file /var/lib/asterisk/astdb.sqlite3 from the working peer to the malfunctioning peer. (Be sure Asterisk is stopped on both systems before trying this).

    Once corrected, be sure to resolve the underlying FreePBX/configuration/etc issue. As well, ensure that you have set the Asterisk database to synchronize in HAAst’s configuration file.

    Telium Support Group
    Moderator
    Post count: 263

    After looking at your system we can see that you mistakenly updated the PBX connection information in the problematic phone sets. The Call Continuity module sits between Asterisk and the phone sets transparently (it uses the same IP and Port that Asterisk did, so phones should not see a difference). In your case you changed some phones sets to connect to the cluster’s SIP on port 15060 (the Asterisk SIP port), but they should be connecting to port 5060 (the HAast CC module SIP port).

    In the following diagram, you can see how you are moving from the top topology, to the bottom topology. This is *transparent* to phone sets – do not change their SIP configuration information. Only Asterisk SIP settings have changed (to route SIP traffic through the HAast CC module).

    HAast

    Telium Support Group
    Moderator
    Post count: 263

    Error code 5 reflects a non-descript exit condition and is harmless.

    In versions of HAAst prior to 2017 a warning with exit code 2 (in the case of MySQL database/table synchronization) means that differences were found between the source and destination tables but they were successfully synchronized. Other exit codes in this context are 0 (success, no differences), 1 (error condition), and 3 (combination of 1 and 2)

    In version of HAAst from Jan 2017 onwards exit code 0 always means success (even if differences were found and then synced successfully).

    No action is required; this is a normal exit code and indicates synchronization is working. We have notified the author of the library we use in the hope of a more appropriate exit code.

    Telium Support Group
    Moderator
    Post count: 263

    FreePBX includes a backup script which causes high CPU and IO latency when it runs. During this time the entire system becomes unresponsive, and call quality may suffer. HAAst will accurately detect that the server has become unresponsive and initiate a failover.

    Customers don’t usually realize the system degradation caused by the FreePBX backup script as the script normally runs late at night. However, if the CPU and/or IO latency become severe, processes may hang/crash/use 100% CPU.

    The solution is to start by dealing with the root cause: disable the FreePBX backup script. This script is known to cause a variety of other system related issues. Instead use a high quality backup program (free or commercial). If that is not an option, Telium can assist you in modifying your setup to allow the script to run better behaved.

    Next, consider only backing up the standby peer (letting your backup program determine which is active). Finally, you can increase the priority of the HAAst process by increasing the [common] processpriority value by 2 (or higher if that is not sufficient).

    Telium Support Group
    Moderator
    Post count: 263
    in reply to: GLIBC error #6606

    The wrong HAAst package has been installed. HAAst is looking for libraries that are more modern (recent) or old than those offered by your Linux distribution.

    If you are sure you downloaded the right HAAst package then try a system wide update using your package manager (eg: “yum update”). Otherwise, return to the Telium web site and download a more suitable HAAst package.

    If you don’t see a HAAst package that exactly matches your Linux distribution, try downloading the package for the oldest Linux distribution. The LTS (Long Term Support) versions of versions of Red Hat (eg: v6) and Ubuntu (eg: v12) are usually the best packages to try.

    Telium Support Group
    Moderator
    Post count: 263
    in reply to: Untarring error #6605

    The library you downloaded is incomplete and/or corrupt. Try again to download the file. Before expanding the archive, check the md5sum
    value of the downloaded file and compare it to the value shown on the Telium website for that file.

    Some users find that their browser cannot reliably download files – in which case we recommend using FTP instead to get the file.

    Telium Support Group
    Moderator
    Post count: 263

    This problem can occur upon initial installation, or after an upgrades (if you deleted the contents of your /usr/local/haast/web_interface directory), and you have configured Apache to require a password when browsing the (virtual) directory. The problem is that your password file was not created (or was deleted during your upgrade). Please review section for 3.4.5 for details, but in summary you can create the
    missing password file as follows:

    htpasswd -c /usr/local/haast/web_interface/.htpasswd user1

    which will also prompt you for a password for the newly created user account named ‘user1’.

    Telium Support Group
    Moderator
    Post count: 263

    There is a known bug (in Ubuntu/Debian) relating to constantly changing MAC addresses for bonded interfaces. Further information can be found here: http://blog.widodh.nl/2015/09/ubuntu-and-the-changing-mac-address-withbonding/ or found on the Ubuntu bug tracker as bug 1288196.

    The solution is that you must adopt one of the workarounds identified in the above article to avoid problems with HAAst functionality. Telium’s recommendation is to fix the MAC address for the bonded interface as follows (as recommended in the article):

    auto bond0
    iface bond0 inet manual
    hwaddress fe:80:12:04:6d:6f
    bond-slaves none
    bond-mode 4
    bond-miimon 100
    bond-updelay 5
    bond-downdelay 5

    and if your distro permits, re-use a MAC address for the bondX interface which matches one of the underlying physical MAC addresses.

    Telium Support Group
    Moderator
    Post count: 263

    Linux is complaining that something is missing. The most likely cause is that you are missing a prerequisite package. HAAst then tries to access a missing operating system library which crashes the program.

    If you enabled the HAAst performance database option (writing to MySQL), but you skipped the installation steps involving MySQL drivers then you might see this error. The solution is to repeat/complete the steps in the Detailed Installation Guide relating to MySQL (sections 3.2 and 3.3)

    If that’s not it, then recheck each prerequisite package as you likely missed one.

    Telium’s engineering group is always interested to gather segfault information – so if you can provide direct SSH access we will take a look. (There is no cost to you, we just want to gather details). A segfault is rare and usually do to Linux prerequisites missing or misconfigured, but this is a top priority for our engineering team.

Viewing 15 posts - 241 through 255 (of 258 total)