Smart hard


Smart status bad backup and replace что делать?

Некоторые пользователи персональных компьютеров или ноутбуков при включении своего компьютера могут наблюдать сообщение smart status bad backup and replace. В ноутбуках HP это же сообщение имеет чуть другой вид — smart hard disk error 301, но имеет тот же самый смысл.

Причиной его появления является стечение двух обстоятельств:

  1. В настройках BIOS включена проверка S.M.A.R.T. жесткого диска;
  2. Сам S.M.A.R.T. жесткого диска имеет сообщения о неполадках.

Что значит Smart status bad backup and replace?

Если вы видите данное сообщение на своем компьютере или ноутбуке это значит лишь одно — жесткий диск (винчестер) с большой вероятностью в самое ближайшее время потребует замены.

Первое, что стоит сделать в данном случае это скопировать с жесткого диска все важные для вас данные на любой другой носитель, такой как флешка, облако или внешний жесткий диск.

Следующим шагом будет проверка того самого S.M.A.R.T. который является своеобразной информационной системой любого жесткого диска, отображающей все возникающие ошибки и проблемы.

Чаще всего возникают ошибки по позиции 05 Reallocated sector count. Количество переназначенных секторов превысило допустимое значение. Другими словами на жестком диске появилось много БЭД блоков.

При возникновении данной ошибки рекомендуется проверить жесткий диск на наличие BAD блоков.

Конечно можно продолжать использовать винчестер при таком сообщении S.M.A.R.T. но хранить на нем важные данные крайне не рекомендуется, так как в любой момент он может окончательно выйти из строя, что грозит потерей всех данных или их дорогостоящей процедурой восстановления.

Как убрать Smart status bad backup and replace?

Если вы не собираетесь менять жесткий диск и продолжать его использование, то данное сообщение можно попробовать отключить в настройках BIOS.

Для этого заходим в BIOS.

Далее ищем что — нибудь связанное со словом S.M.A.R.T. В разных версиях BIOS данный параметр может находиться в разных местах.

Часто настройка S.M.A.R.T. находится на начальном экране BIOS, где показаны все подключенные устройства IDE и SATA.

Как убрать Smart status bad backup and replace

Здесь нужно выбрать ваш жесткий диск клавишей «Enter» и выключить проверку S.M.A.R.T., переведя его в состояние Disabled.

Отключаем Smart status bad backup and replace в настройках BIOS

Также эта настройка может находиться во вкладке «advanced»

Smart settings

Отключаем smart hard disk error 301 hp

В любом случае отключение данного сообщения это не решение проблемы. Самое правильное — это менять жесткий диск, так как если сообщение о плохом состоянии СМАРТ появляется при включении, то это будет единственным правильным его устранением.

helpadmins.ru

HDD SMART monitoring, HDD SMART Capability, Hard Disk SMART

Другие идентичные названия опции: HDD SMART Capability, Hard Disk SMART.

Функция HDD SMART monitoring (S.M.A.R.T.-мониторинг жесткого диска) используется для включения технологии S.M.A.R.T., позволяющей пользователю осуществлять диагностику и мониторинг состояния жесткого диска при помощи прикладных программ. Опция может принимать два значения – Enabled (Включено) или Disabled (Выключено). Во многих BIOS данная опция также имеет название HDD S.M.A.R.T. Capability.

Содержание статьи

Принцип работы

S.M.A.R.T – это технология, предназначенная для контроля состояния жестких дисков персонального компьютера. Технология была разработана в середине 1990-x гг. и в настоящее время поддерживается большинством жестких дисков, а также материнских плат. Ее возможности включают быстрый поиск ошибок на диске и их исправление. Помимо локальных дисков технология S.M.A.R.T может использоваться также и для сетевых накопителей.

Суть этой технологии  состоит в том, что время от времени контроллер жесткого диска отсылает системе ряд данных, по которым можно судить о текущих параметрах и характеристиках накопителя и его отдельных узлов. К таким данным относятся, в частности, сведения о степени износа отдельных элементов накопителя и о состоянии поверхности магнитных дисков, на которых хранится информация.

Система S.M.A.R.T., как правило, не предполагает автономного мониторинга накопителей вне рамок какой-либо операционной системы. Для функционирования S.M.A.R.T. в операционной системе требуются два основных элемента – наличие в контроллере накопителя служебного ПО, собирающего информацию о накопителе, и наличие в самой ОС специализированных программ для получения данной информации.

Стоит ли включать опцию?

Если вы хотите использовать преимущества технологии S.M.A.R.T., в частности, пользоваться прикладными программами для мониторинга жестких дисков, то вам, естественно, необходимо включить данную опцию в вашем BIOS.

Однако существует ряд причин, по которым использование данной технологии могло бы быть нежелательным. Например, в том случае, если вы используете сетевые диски, то информация, посылаемая по сети системой S.M.A.R.T., может иногда приводить к перезагрузке системы. Если вы сталкиваетесь с такого рода проблемами, то опцию HDD SMART monitoring лучше всего отключить.

Функция S.M.A.R.T. является важной для большинства пользователей, однако, стоит помнить, что само по себе включение опции не является гарантией того, что система будет автоматически отслеживать состояние ваших жестких дисков. Для того, чтобы проводить мониторинг жестких дисков в операционной системе, необходимо использовать специальное программное обеспечение – программы для S.M.A.R.T.-мониторинга. Как правило, эти программы работают в фоновом режиме и получают информацию от жестких дисков в режиме реального времени.

Следует помнить, однако, что опция HDD SMART monitoring занимает какую-то часть системных ресурсов, поскольку при ее использовании система постоянно принимает пакеты данных от контроллера жесткого диска, что нагружает память и процессор.  То же самое можно сказать и о программах мониторинга. Поэтому не во всех случаях  процесс мониторинга при помощи S.M.A.R.T. может быть оправдан, тем более, что большинство современных жестких дисков обладает сравнительно высокой надежностью. К тому же, несмотря на то, что технология S.M.A.R.T. способна заблаговременно отслеживать ряд критических ситуаций с жесткими дисками и некоторые их предаварийные состояния,  тем не менее, она не всегда способна предсказать заранее неожиданный выход накопителя из строя.

Таким образом, использование программ S.M.A.R.T. – мониторинга не является панацеей от негативных последствий, которые несет в себе выход жесткого диска из строя и не заменит процедуры резервного копирования важных данных.

Применение опции HDD SMART monitoring, а также использование мониторинговых утилит может быть полезным, но оно может быть оправдано, как правило, лишь в случае применения ненадежных жестких дисков, или тех на которых хранится чрезвычайно важная информация. Следовательно, если вы не используете утилиты для мониторинга жестких дисков, то вы можете смело выключить данную опцию, чтобы избежать ненужного расходования системных ресурсов.

Также следует упомянуть о том, что многие BIOS имеют встроенную процедуру проверки жестких дисков при помощи технологии S.M.A.R.T., которая выполняется при загрузке компьютера. Если ваша материнская плата имеет подобную опцию, то включение функции S.M.A.R.T.-мониторинга позволит проводить быструю диагностику HDD во время загрузки. Однако следует иметь в виду, что эта процедура отнимает какое-то время во время загрузки компьютера и, к тому же, она не сможет заменить специализированные программы для мониторинга и диагностики жестких дисков.

Порекомендуйте Друзьям статью:

biosgid.ru

PassMark DiskCheckup - SMART hard drive monitoring utility

PassMark DiskCheckup ™

Versions:V3.4 (Build 1003)

Latest release date:3 Feb 2017

Price:Free for personal use. Company license is US$19.00 per license.

Platforms:Windows XP-SP3, 2003 Server, 2008 Server, Server 2012, Vista, Windows 7, Windows 8, and Windows 10.

Requirements:16 MB RAM, 2 MB of disk space.

Related softwareSysInfo DLL SDK

PassMark DiskCheckup™ allows the user to monitor the SMART attributes of a particular hard disk drive. SMART (Self-Monitoring Analysis and Reporting Technology) is a feature on a computer's hard disk for providing various monitoring indicators of disk reliability. If SMART is enabled on a hard disk, the system administrator can receive analytical information from the hard drive to determine a possible future failure of the hard drive.

SMART monitors elements of possible long term drive failure, such as 'Spin Up Time', the number of start/stops, the number of hours powered on and the hard disk temperature.

DiskCheckup displays the current values of the SMART attributes, along with the Threshold value for that attribute. If an attribute drops below its threshold, the drive cannot guarantee that it will be able to meet its specifications in the future.

Note that SMART attributes change slowly over time and are helpful attempts to diagnose the life span of a particular drive. DiskCheckup monitors these changes over a long period and predict the date (if available) of the Threshold Exceed Condition (TEC), which is displayed on the main window.

DiskCheckup can also execute built-in Disk Self Test (DST) routines implemented by the vendor to detect drive failures. There are two main self test routines: Short Test and Extended Test. The results of these tests are displayed in DiskCheckup.

DiskCheckup has the ability to detect and set the sizes of the Host Protected Area (HPA) and Device Configuration Overlay (DCO). The HPA/DCO are hidden areas of the hard disk that contain data not accessible by the user. These areas can be removed to reveal the data hidden within these areas.

DiskCheckup also displays device information, such as the drive geometry, serial number, model number, media rotation rate, and supported features. The real-time activity of the disk is also displayed and updated periodically.

DiskCheckup can be configured to perform e-mail notification when a SMART attribute has been detected to be less than the allowable threshold value. Such threshold values are determined by the hard disk manufacturer. For a drive to be considered "good", all the SMART attributes must be above these values. Different SMART attributes have different threshold values. For more configuration options, refer to the screen shot below.

A hard drive that supports SMART, plus compatible drivers. Most recent hard drives (SATA/USB/FireWire/PCIe M.2 NVMe) are OK but drives connected via SCSI or hardware RAID are not supported. Drives configured as software RAID (dynamic disks) via Windows Disk Management will also work.

DiskCheckup is free for personal use. Company licenses can be purchased for US$19.00 USD per license. Discounts apply for multiple licenses.

For a quote on multiple licenses or more information , please contact

DiskCheckup is built with PassMark's SysInfo DLL SDK. If you would like to use this technology in your application, please check out the SysInfo DLL SDK or contact us at

Starting with DiskCheckup V3.2, the software allows the users to translate the program into their own language. DiskCheckup will look for localization.txt file within the program folder and will use the strings in this file. If you have a translation that is not already available and have made a new translation file for it. Please send it in. We may possibly include it for future release of the software. Mail these files to .Note: If you edit localization.txt, make sure to make a backup as the file is replaced on subsequent installs.

Follow the download button above to download DiskCheckup

Double click on the DiskCheckup.exe installer file to run the DiskCheckup installer. Additional information is provided in the included HTML help files. You can view them from DiskCheckup's help button/menu or open the start page "index.html" in the "HELP" folder.

  • Hardware RAID and SCSI are not supported. But dynamic disks (software RAID) are supported.
  • The Silicon Image SIL0680 Ultra-133 ATA RAID Controller has a bug which can cause a system lockup when the SMART data is accessed.. This bug exists in the current driver version, 1.0.1.7 and presumably in previous versions.
  • TEC predictions about future failure dates should be taken as a guide only and should not be considered accurate.
  • The majority of newer drives connected via USB and Firewire are supported. However, older drives may not be supported due to the protocol bridge on the hard disk not supporting SMART commands)
  • Software Localization: SMART attribute list is not translated.

The SmartDisk DLL SDK and DiskCheckup utilizes statistical analysis to predict possible failure dates of hard disk drives. Because there are no 'certainties' in statistical analysis, PassMark® Software Pty Ltd disclaims all liability for any and all costs incurred by either:

1) The hard disk drive failing before the predicted failure date estimated by the application, or2) The hard disk drive continuing to function beyond the predicted failure date estimated by the application.

In either situation, PassMark® disclaims liability for any losses due to loss or damage to data. PassMark® further disclaims any liability for costs incurred in anticipation of a disk drive failure that does not eventuate (e.g. replacement hard disk drives, transfer time, downtime, etc).

This disclaimer is in addition to the Disclaimer of Warranty and Limitation of Liability mentioned elsewhere in the EULA and on this website.

The following language files are to be use with PassMark DiskCheckUp software (V3.2+). They provide localization to user's preferred language. Language files will be listed below until their possible inclusion into the official installer. To use a language file, you can download a file below and replace the "localization.txt" file in the program folder. Make sure to make a backup of the original if you want to revert back to the original installed language.

LanguageRequiresTranslated ByDownload
FrenchDiskCheckUp V3.2 or laterPierre le Lidgeu from Colok-Traductions TeamDownload (10KB)
RussianDiskCheckUp V3.2 or laterAlecs962 from wylek.ru TeamDownload (10KB)

www.passmark.com

SMART Hard Drive Monitoring @ Calomel.org

home rss search January 01, 2017

Using S.M.A.R.T. to monitor hard drive health

SMART is is defined as the Self Monitoring Analysis & Reporting Technology. It is a monitoring system for computer hard disks to detect and report on various indicators of reliability, in the hope of anticipating failures. In effect, SMART can be used to monitor the health of the hard drive.

Fundamentally, hard drives can suffer one of two classes of failures:

  • Predictable ones, when some failure modes, especially mechanical wear and aging, happen gradually over time. A monitoring device can detect these, much as a temperature dial on the dashboard of an automobile can warn a driver before serious damage occurs that the engine has started to overheat.
  • Unpredictable ones, when other failures may occur suddenly and unpredictably, such as an electronic component failing.

Mechanical failures, which are usually predictable failures, account for 60 percent of drive failure. The purpose of S.M.A.R.T. is to warn a user or system administrator of impending drive failure while time remains to take preventative action such as copying the data to a replacement device. Approximately 30% of failures can be predicted by S.M.A.R.T.

Work at Google on over 100,000 drives has shown little overall predictive value of S.M.A.R.T. status as a whole, but that certain sub-categories of information S.M.A.R.T. implementations might track do correlate with actual failure rates - specifically that following the first scan error, drives are 39 times more likely to fail within 60 days than drives with no such errors and first errors in reallocations, offline reallocations, and probational counts are also strongly correlated to higher failure probabilities.Wikipedia, SMART

The most basic information that SMART provides is the SMART status. It provides only two values, "threshold not exceeded" or "threshold exceeded". Often these are represented as "drive OK" or "drive fail" respectively. A "threshold exceeded" value is intended to indicate that there is a relatively high probability that the drive will not be able to honor its specification in the future: that is, it's "about to fail". The predicted failure may be catastrophic or may be something as subtle as inability to write to certain sectors or slower performance than the manufacturer's minimum.

The SMART status does not necessarily indicate the drive's reliability now or in the past. If the drive has already failed catastrophically, the SMART status may be inaccessible. If the drive was experiencing problems in the past, but now the sensors indicate that the problems no longer exist, the SMART status may indicate the drive is OK, depending on the manufacturer's programming.

Lets take a look at what SMART can do through the "atactl" binary in OpenBSD.

Using SMART to see the drives attributes

You can look at the attributes of our example drive by using the readattr as seen below in the scrollable window. These value cover most of the common functions of the hard drive including retry amounts and failure counts. This drive has not reported any errors.

[email protected]: /sbin/atactl /dev/wd0c readattr Attributes table revision: 16 ID Attribute name Threshold Value 3 Spin Up Time 63 176 4 Start/Stop Count 0 253 5 Reallocated Sector Count 63 253 6 Unknown 100 253 7 Seek Error Rate 0 253 8 Seek Time Performance 187 240 9 Power-on Hours Count 0 235 10 Spin Retry Count 157 253 11 Unknown 223 253 12 Device Power Cycle Count 0 253 192 Power-off Retract Count 0 253 193 Load Cycle Count 0 253 194 Temperature 0 25 195 Unknown 0 253 196 Reallocation Event Count 0 253 197 Current Pending Sector Count 0 253 198 Off-line Scan Uncorrectable Sect 0 253 199 Ultra DMA CRC Error Count 0 199

Using SMART to monitor the drive

You can use the binary "atactl" to monitor the SMART values of your hard drive too. In this example we are going to use the binary to check the primary hard drive of a OpenBSD system disk and notify us by email of any errors.

When using the one or both of the following options "atactl" will check the drive for errors. If an error is found then an email will be sent to root by means of the "smartenable" argument. It only sends out an email if an error is found to reduce the spam.

Option 1: Run Once - The following can be run SMART once when the system boots to check the primary hard drive. Put these lines into your /etc/rc.local to check SMART stats on boot:

## SMART hard drive boot check if [ -x /sbin/atactl ]; then echo -n ' smartenable'; /sbin/atactl /dev/wd0c smartenable fi

Option 2: Run Periodically through Cron - By runnning the SMART check through cron you can make sure you have a heads up of any serious problems with the drive. The following will run the command every morning at 5:30am. You will only receive an email to root if there is a problem with the drive.

#minute (0-59) #| hour (0-23) #| | day of the month (1-31) #| | | month of the year (1-12 or Jan-Dec) #| | | | day of the week (0-6 with 0=Sun or Sun-Sat) #| | | | | commands #| | | | | | #### SMART Hard Drive Status 30 5 * * * /sbin/atactl /dev/wd0c smartstatus >> /dev/null 2>&1

calomel.org

What SMART Hard Disk Errors Actually Tell Us

What if a hard drive could tell you it was going to fail before it actually did? Is that possible? Each day Backblaze records the SMART stats that are reported by the 67,814 hard drives we have spinning in our Sacramento data center. SMART stands for Self-Monitoring, Analysis and Reporting Technology and is a monitoring system included in hard drives that reports on various attributes of the state of a given drive.

While we’ve looked at SMART stats before, this time we’ll dig into the SMART stats we use in determining drive failure and we’ll also look at a few other stats we find interesting.

We use Smartmontools to capture the SMART data. This is done once a day for each hard drive. We add in a few elements, such as drive model, serial number, etc. and create a row in the daily log for each drive. You can download these logs files from our website. Drives which have failed are marked as such and their data is no longer logged. Sometimes a drive will be removed from service even though it has not failed, like when we upgrade a Storage Pod by replacing 1TB drives with 4TB drives. In this case, the 1TB drive is not marked as a failure, but the SMART data will no longer be logged.

SMART stats we use to predict Hard Drive failure

For the last few years we’ve used the following five SMART stats as a means of helping determine if a drive is going to fail.

Attribute Description
SMART 5 Reallocated Sectors Count
SMART 187 Reported Uncorrectable Errors
SMART 188 Command Timeout
SMART 197 Current Pending Sector Count
SMART 198 Uncorrectable Sector Count

When the RAW value for one of these five attributes is greater than zero, we have a reason to investigate. We also monitor RAID array status, Backblaze Vault array status and other Backblaze internal logs to identify potential drive problems. These tools generally only report exceptions, so on any given day the number of investigations is manageable even though we have nearly 70,000 drives.

Let’s stay focused on SMART stats and take a look at the table below which shows percentage of both failed and operational drives, which are reporting a RAW value that is greater than zero for the SMART stat listed.

While no single SMART stat is found in all failed hard drives, here’s what happens when we consider all five SMART stats as a group.

Operational drives with one or more of our five SMART stats greater than zero – 4.2%

Failed drives with one or more of our five SMART stats greater than zero – 76.7%

That means that 23.3% of failed drives showed no warning from the SMART stats we record. Are these stats useful? I’ll let you decide if you’d like to have a sign of impending drive failure 76.7% of the time. But before you decide, read on.

Having a given drive stat with a value that is greater than zero may mean nothing at the moment. For example, a drive may have a SMART 5 raw value of 2, meaning two drive sectors have been remapped. On it’s own such a value means little until combined with other factors. The reality is it can take a fair amount of intelligence (both human and artificial) during the evaluation process to reach the conclusion that an operational drive is going to fail.

One thing that helps is when we observe multiple SMART errors. The following chart shows the incidence of having one, two, three, four or all five of the SMART stats we track have a raw value that is greater than zero.To clarify, a value of 1 means that of the five SMART stats we track only one has a value greater than zero, while a value of 5 means that all five SMART stats we track have a value greater than zero. But, before we decide that multiple errors help, let’s take a look at the correlation between these SMART stats as seen in the chart below.

In most instances the stats have little correlation and can be considered independent. Only SMART 197 and 198 have a good correlation meaning we could consider them as “one indicator” versus two. Why do we continue to collect both SMART 197 and SMART 198? Two reasons: 1) the correlation isn’t perfect so there’s room for error, and 2) not all drive manufacturers report both attributes.

How does understanding the correlation, of lack thereof, of these SMART stats help us? Let’s say, a drive reported a SMART 5 raw value of 10 and SMART 197 raw value of 20. From that we could conclude the drive is deteriorating and should be scheduled for replacement. Whereas, if the same drive had SMART 197 raw value of 5 and a SMART 198 raw value of 20 and no other errors, we might hold off on replacing the drive awaiting more data, such as the frequency of the errors occurring.

Error Distribution

So far it might sound like we will fail a hard drive if we just observe enough SMART values that are greater than zero, but we also have to factor time into the equation. The SMART stats we track, with the exception of SMART 197, are cumulative in nature, meaning we need to consider the time period over which the errors were reported.

For example, let’s start with a hard drive that jumps from zero to 20 Reported Uncorrectable Errors (SMART 187) in one day. Compare that to a second drive which has a count of 60 SMART 187 errors, with one error occurring on average once a month over a five year period. Which drive is a better candidate for failure?

Another stat to consider: SMART 189 – High Fly Writes

This is a stat we’ve been reviewing to see if it will join our current list of five SMART stats we use today. This stat is the cumulative count of the number of times the recording head “flies” outside its normal operating range. Below we list the percentage of operational and failed drives where the SMART 189 raw value is greater than zero.

The false positive percentage of operational drives having a greater than zero value may at first glance seem to render this stat meaningless. But what if I told you that for most of the operational drives with SMART 189 errors, that those errors were distributed fairly evenly over a long period of time. For example, there was one error a week on average for 52 weeks. In addition, what if I told you that many of the failed drives with this error had a similar number of errors, but they were distributed over a much shorter period of time, for example 52 errors over a one-week period. Suddenly SMART 189 looks very interesting in predicting failure by looking for clusters of High Fly Writes over a small period of time. We are currently in the process of researching the use of SMART 189 to determine if we can define a useful range of rates at which errors occur.

SMART 12 – Power Cycles

Is it better to turn off your computer when you are not using it or should you leave it on? The debate has raged on since the first personal computers hit the market in the 80’s. On one-hand turning off a computer “saves” the components inside and saves a little on your electricity bill. On the other-hand the shut-down / start-up process is tough on the components, especially the hard drive.

Will analyzing the SMART 12 data finally allow us to untie this Gordian knot?

Let’s compare the number of power cycles (SMART 12) of failed drives versus operational drives.

At first blush, it would seem we should keep our systems running as the failed drives had 175% more power cycles versus drives that have not failed. Alas, I don’t think we can declare victory just yet. First, we don’t power cycle our drives very often. On average, drives get power-cycled about once every couple of months. That’s not quite the same as turning off your computer every night. Second, we didn’t factor in the age range of the drives. To do that we’d need a lot more data points to get results we could rely on. That means, sadly, we don’t have enough data to reach a conclusion.

Perhaps one of our stat-geek readers will be able to tease out a conclusion regarding power cycles. Regardless, everyone is invited to download and review our hard drive stats data including the SMART stats for each drive. If you find anything interesting let us know.

Andy has 20+ years experience in technology marketing. He has shared his expertise in computer security and data backup at the Federal Trade Commission, Rootstech, RSA and over 100 other events. His current passion is to get everyone to back up their data before it's too late.

Category:  Cloud Storage

www.backblaze.com


Смотрите также