Netbackup Troubleshooting

November 17, 2017 | Author: Vijaykumar Shilpa | Category: Backup, Computer File, Server (Computing), Java (Programming Language), Computer Data Storage
Share Embed Donate


Short Description

Download Netbackup Troubleshooting...

Description

S.NO

Status Code

Status

1

58

The server was unable to connect to the client.

2

59

access to the client was not allowed(59)

3

191

duplicate exited with status 191

4

50

import failure-phase 2

5

96

unable to allocate new media for backup Check for available medias in goodies-available

Frozen state reason--Backup exec tapes was loaded,NBU reads it have a BE data & mark it as FROZE 6

1

Partially success

7

74

The bpstart_notify script on the client takes too long.

8

2

none of the requested files were backed up(2)

9

156

snapshot error encountered

10

13

network issue

11

41

network issue

12

25

network issue

13

24

network issue

14

14

file write failed

status 2009 All compatible drive paths are down but media is available 129

(Disk storage unit is full)

status 29/141

catalog job failed

Status 14

file write failed)

Issue can't connect to client(58)--Network issue

or the media server tries to access the client, but the client does not recognize the server as a valid server. duplicate images was not happened due to lack of tapes in the librray http://www.symantec.com/business/support/index?page=content&id=TECH28499 There is no tapes for backup

as loaded,NBU reads it have a BE data & mark it as FROZEN state

ob detail & proceed as per the error.always check for the removable storage services should be disabled. client timed out waiting for bpstart_notify to complete catalog job failed due to lack of tape

Found some Events 1049 and 1284 on server AURHOM0002 event viewer

due to network issue

All compatible drive paths are down but media is available

We have modified the registry entry “HKEY_LOCAL_MACHINE\SOFTWARE\Veritas\NetBackup\CurrentVersion\Config\AUTHE We have removed all the Trusted Domain entry leaving only 2 , One with Master Server Name & Nestle kept as it is. UKYORM0015 "ADDED AUTOMATICALLY" WINDOWS ukyorm0015.nestle.com 0 NESTLE "ADDED AUTOMATICALLY" WINDOWS ukyorm0015.nestle.com 0 I\O error for files & folders

Troubleshooting -solution Check the Network connection & check the server is reachable or not Need to authorise the client server & then check Overwritable tapes & should monitor the duplication jobs Don’t logoff your session until the import job completes send market to load the tapes NA NA NA Change the value in bp start notify attributes Overwritable tapes & should monitor the catalog jobs

network issue

HINE\SOFTWARE\Veritas\NetBackup\CurrentVersion\Config\AUTHENTICATION_DOMAIN”

nly 2 , One with Master Server Name & Nestle kept as it is.

orm0015.nestle.com 0

15.nestle.com 0 go to host properties/matsre server/client attrubutes/use media server for duplication.

IF the status code is not available then write a short description of the problem such as Robot Down, Service Down, Server D Please follow the below ticket title update for NBU tickets

EUR SCOM HQVEVM0024 CHLSNM0004 CHLSNA0101 STATUS 14

bot Down, Service Down, Server Down, Dedup Down, GWAN Down etc. etc. If you have any doubt please check with your shift lead.

e check with your shift lead.

Please find simple useful commands for SLP monitoring on NBU, Refer attached doc as it has more information. Action To check the ctime from backup id To inactivate all further duplication jobs To activate all duplication jobs To inactive further duplications for specific SLP To activate all further duplication jobs to view the status of all duplication jobs to view the list of incomplete duplication jobs to inactivate the STU for duplications only to activate the STU for duplications To cancel a specific duplication job cd "Program Files\VERITAS\NetBackup\bin" cd "Program Files\VERITAS\NetBackup\bin\goodies" To check the available medias in the tape library C:\Program Files\VERITAS\NetBackup\bin\goodies>available_media.cmd

to view the list of incomplete duplication jobs C:\Program Files\VERITAS\NetBackup\bin\admincmd>nbstlutil stlilist image_incomplete -U

To check the client connectivity between the master & the client C:\Program Files\VERITAS\NetBackup\bin\admincmd>bptestbpcd -client ptlism0002.nestle.com

To know abot the ctime -use below cmd C:\Program Files\VERITAS\NetBackup\bin>bpdbm -ctime 1338150733 1338150733 = Sun May 27 22:32:13 2012

C:\Program Files\VERITAS\Volmgr\bin>vmoprcmd.exe -d

C:\Program Files\VERITAS\Volmgr\bin>vmdelete.exe -m (Media ID)

C:\Program Files\VERITAS\NetBackup\bin\admincmd>nbstlutil stlilist -image_incomplete C:\Program Files\VERITAS\NetBackup\bin\admincmd>hostname FRBVRM0001 C:\Program Files\VERITAS\NetBackup\bin\admincmd>

Reauthenticate on NBU C:\Program Files\VERITAS\NetBackup\bin>bpnbat.exe Stop the duplication C:\Program Files\VERITAS\NetBackup\bin\admincmd>nbstlutil inactive -lifecycle DKKOLM0000_DKCPHM0004_LP01_AIR c:\Program Files\VERITAS\NetBackup\bin\admincmd>nbstlutil inactive -lifecycle all To list all SLP jobs, nbstlutil stlilist dkkolw0001.nestle.com_1337623200 nbstlutil -wait inactive dkkolw0001.nestle.com_1337623200 nbstlutil inactive -lifecycle $DKKOLM0000_DKCPHM0004_LP01_AIR

nbstlutil -wait inactive -lifecycle SLP_DKKOLM0000_DKCPHM0004_LP01_AIR -backupid dkkolw0001.nestle.com_1337623200 nbstlutil -wait inactive -backupid dkkolw0001.nestle.com_1337364000 nbstlutil stlilist -U nbstlutil stlilist nbstlutil stlilist -image_incomplete -U to find ctime : C:\Program Files\VERITAS\NetBackup\bin>bpdbm -ctime 1337374479

FOR ROB test C:\Program Files\VERITAS\Volmgr\bin>vmoprcmd C:\Program Files\VERITAS\Volmgr\bin>robtest

Robot Selection --------------1) TLD 0 2) none/quit Enter choice: 1 1 Robot selected: TLD(0) robotic path = {3,0,0,1} Invoking robotic test utility: C:\Program Files\VERITAS\Volmgr\bin\tldtest.exe -rn 0 -r {3,0,0,1} Opening {3,0,0,1} MODE_SENSE complete Enter tld commands (? returns help information) sd drive 1 (addr 1) access = 1 Contains Cartridge = no drive 2 (addr 2) access = 1 Contains Cartridge = no drive 3 (addr 3) access = 1 Contains Cartridge = no drive 4 (addr 4) access = 1 Contains Cartridge = yes Source address = 1005 (slot 5) Barcode = BDC916L5

READ_ELEMENT_STATUS complete ss slot 1 (addr 1001) contains Cartridge = no

slot 2 (addr 1002) contains Cartridge = no slot 3 (addr 1003) contains Cartridge = no slot 4 (addr 1004) contains Cartridge = no slot 5 (addr 1005) contains Cartridge = no slot 6 (addr 1006) contains Cartridge = yes Source address = 1006 Barcode = BDC917L5 slot 7 (addr 1007) contains Cartridge = yes Source address = 1007 Barcode = BDC918L5 slot 8 (addr 1008) contains Cartridge = yes Source address = 1008 Barcode = BDC919L5 slot 9 (addr 1009) contains Cartridge = yes Source address = 1009

Barcode = BDC912L5 slot 10 (addr 1010) contains Cartridge = yes Source address = 1010 Barcode = BDC913L5 slot 11 (addr 1011) contains Cartridge = yes Source address = 1011 Barcode = BDC914L5 > q READ_ELEMENT_STATUS complete q Robot Selection --------------1) TLD 0 2) none/quit Enter choice: 2 2 C:\Program Files\VERITAS\Volmgr\bin> The command to move a tape from drive 1 to slot 15 is : m d1 s15

To check the logs: nbrbutil -dump To release the jobs: nbrbutil -releaseMDS 10911

10.196.208.195 administrator 20121964 Stuck media in the Tape drive Library is offline please to apply troubleshooting and provide feedback.

Remove the stuck media ·

To power library off

·

To check cable are properly connected from server to library

·

To turn on the Library.

·

To load scratch tapes

To check the status of the SLP (Storage Lifecycle Policy) if it is in Active or Inactive mode. Please run command under: \Installation_path ...\Netbackup\bin\admincmd>Nbstl -L Something like this mentioned below will be the output of the command. It’ll show you complete details of the SLP.

In few servers, “Nbstl –L” is not working showing error that no entity found. In such case please run the command with the storage Lifecycle policy name, \Installation_path ...\Netbackup\bin\admincmd>Nbstl EGCAIM001_LP01 –L

tached doc as it has more information. command bpdbm -ctime nbstlutil inactive -lifecycle all nbstlutil active -lifecycle all nbstlutil -wait inactive -lifecycle DKKOLM0000_DKCPHM0004_LP01_AIR nbstlutil active -lifecycle DKKOLM0000_DKCPHM0004_LP01_AIR nbstlutil stlilist -U nbstlutil stlilist -image_incomplete nbstlutil inactive -lifecycle nbstlutil active -destination nbstlutil cancel -backupid

to check the communication-bw cl & Mas

to check the duplication jobs

ptlism0002.nestle.com

To activate particular backup id C:\Program Files\VERITAS\NetBackup\bin\admincmd>nbstlutil.exe active -backupid n lnunw0001.nestle.com_1346436015 -force

For deleting the media from EMM

age_incomplete

Rob Test C:\Program Files\VERITAS\Volmgr\bin> vmoprcmd robtest

lifecycle DKKOLM0000_DKCPHM0004_LP01_AIR

AIR -backupid dkkolw0001.nestle.com_1337623200

1337374479

To check the incomplete Job list

Check for orphaned device allocation. Run this command from cmd: nbrbutil -dump Check the 'MDS Allocation' output at the bottom. Orphaned allocations can be released as follows: nbrbutil -releaseMDS or, if no backups are running, reset all: nbrbutil -resetAll (command is in \veritas\netbackup\bin\admincmd)

Inactive mode.

Path to be executed in C:\Program Files\VERITAS\NetBackup\bin> c:\Program Files\VERITAS\NetBackup\bin\admincmd> c:\Program Files\VERITAS\NetBackup\bin\admincmd> c:\Program Files\VERITAS\NetBackup\bin\admincmd> c:\Program Files\VERITAS\NetBackup\bin\admincmd> c:\Program Files\VERITAS\NetBackup\bin\admincmd> c:\Program Files\VERITAS\NetBackup\bin\admincmd> c:\Program Files\VERITAS\NetBackup\bin\admincmd> c:\Program Files\VERITAS\NetBackup\bin\admincmd> c:\Program Files\VERITAS\NetBackup\bin\admincmd>

telnet czprgd0002 bpcd bpclntcmd -hn czprgd0002

Checked & found the last night daily backup job nlnunl0010 was completed successfull.Hence we close this IM as complete. APPLICATION, INFRASTRUCTURE AND SYSTEM MANAGEMENT/BACKUP - WINDOWS/NETBACKUP nbstlutil stlilist -image_incomplete -U

bpimmedia.exe -mediaid IRD705 -l

n\admincmd>nbstlutil.exe active -backupid n

To check the status of the Drive To move the voume from one slot to another

Hence we close this IM as complete.

Approximate calculationof overwritable tapes--Calculate the data size & ask market for tapes capacity LTO4 1TB LTO5 2TB

Backup completed

ZONE EUR EUR EUR EUR EUR EUR EUR EUR EUR EUR EUR

MasterServer ITMILM0007 ITMILM0007 ITMILM0008 ITPORM0001 MTVALM0001 NLAMNW0000 PLWARM0005 PLWAWM0001 PLWAWM0001 PLWAWM0001 PLWAWM0001

Server ITBERW0000 ITUDIW0000 ITCBUW0001 ITPORL0002 MTVALH0002 NLAMNM0000 PLWARD0018 PLWAWW0004 PLWAWB0000 PLWAWA0020 PLWAWH0001

-F-F--F F-FFF-FFF -F-FFFF F--

Backup completed UKCROM0015 ESBCNM0000

C:\WINNT\system32\NtmsData C:\Temp\Eicar

Status Siddesh is working on the issue Siddesh is working on the issue Siddesh is working on the issue Backup completed Working on the ticket IM0006540293 Backup completed Backup completed IM0007341781- PLWAWM0001-Tape-Drive offline Backup completed Backup completed Backup completed

160.213.34.90

8 zeros

193.148.192.64

8 Zeros

\system32\NtmsData

Netbackup Tips Glossary Term CLI GUI Media Server Master Server

Starting and Stopping Netbackup Stopping Netbackup /usr/openv/netbackup/bin/K77netbackup --> graceful shutdown /usr/openv/netbackup/bin/bpps -a --> check for any remaining processes /usr/openv/netbackup/bin/goodies/bp.kill_all ---> kills all remaining netbackup processes, not necessarily graceful /usr/openv/netbackup/bin/bpps -a --> check for any remaining processes kill -9 for any remaining. NOTE: unkillable processes may require a reboot Starting Netbackup /usr/openv/netbackup/bin/S77netbackup --> after bp.kill_all, to restart

Common Tasks Starting the Administration GUI java from the windows client x-windows from the server - /usr/openv/netbackup/bin/xnb & Checking Backup Status Activity Monitor or /usr/openv/netbackup/bin/admincmd/bpdbjobs -report Cleaning a tape manually Identify the drive name to be cleaned tpclean -L Manually clean the drive: tpclean -C Determining what tapes were used for a backup

GUI Backup and Restore --> Find the file system --> Preview Media Button CLI Find the correct backup images bpimagelist -U -client -d -e Find the media used for those images bpimagelist -U -client -d -e -media Listing the files in a backup Find the tape(s) used (above procedure using bpimagelist) cd /usr/openv/netbackup/db/jobs/done Run the following script and redirect it's output to a text file: for file in `grep MOUNTING *|grep |awk '{print $1}'|sed 's/:MOUNTING//'` do echo $file grep PATH_WRITTEN $file|awk '{print $3}' echo " " echo "==========================================End of Image======================================" echo " " done

This process works for NBU V3.4: cd /usr/openv/netbackup/db/images/ ls -ltr --> this will identify the directory with the proper date verify directory with "bpdbm -ctime cd ls -ltr --> lists all of the backups for this client on this date cat __.f | awk '{print $10}' --> this prints out the files in the backup

For NBU > V3.4 bpflist --help --> undocumented netbackup command to list files from a binary .f file

Inventory the Robot

Inventory Robot --> /opt/openv/volmgr/bin/vmcheckxxx -rt robot_type -rn robot_number -list (where robot_type is tld, a Inventory Robot and Update Configuration --> /opt/openv/volmgr/bin/vmupdate -rt robot_type -rn robot_number -list (w Listing Properties of the Volume Pools vmpool -listall

Scratch Tapes Count scratch tapes: /usr/openv/volmgr/bin/vmquery -pn Scratch | grep -c "robot slot" Moving tapes to the scratch pool If Needed - Expire the tape bpexpdate -ev -d 0 -force -host Move the tape vmchange -p 2 -m Checking Drive Usage /usr/openv/volmgr/bin/vmoprcmd Taking a drive down or up /usr/openv/volmgr/vmoprcmd -down /usr/openv/volmgr/vmoprcmd -up

Performing a Restore From the GUI user backup & restore --> configuration --> client user backup & restore --> configuration --> client to restore directory to search directory depth date range file --> browse backups for restore Adding New Tapes to the Library Using the GUI Media Management --> Actions --> New --> Single Volume . . -->

Media Type (ie DLT) Robot Type (ie TLD) Media ID (from Inventory) Slot Number (from Inventory) Robot Number (ie 0) Volume Group Volume Pool (ie Scratch) Using the CLI vmadd -m -mt -verbose -rt -b -rn -rc1 -p lists all pools, both name and number For example: vmadd -m 000151 -mt dlt -verbose -rt tld -b 000151 -rn 0 -rc1 8 -p 2 -mm 0

Re-using Tapes from other systems or older Netbackups Expire the media bpexpdate -ev MEDIA_ID -d 0 -force -host HOST Deassign the media vmquery -deassignbyid MEDIA_ID 4 0 Move to the scratch pool vmchange -m MEDIA_ID -p POOL# Relabel the media bplabel -ev CIM572 -d dlt -p Scratch Changing the attributes of media Changing the barcode vmchange -barcode CYM100D -m CYM100 Changing the Volume Pool vmchange -m MEDIA_ID -p POOL#

To expire media bpexpdate -ev -d 0 -force -host To unfreeze media List the frozen media /usr/openv/netbackup/bin/goodies/available_media | grep -i FROZEN Unfreeze the media bpmedia -unfreeze -ev -h To relabel a tape bplabel -ev -d -p bplabel -ev 000687 -d dlt -p TriVrgt_OFFSITE To remove media from the Netbackup database Verify that there are no images on the tape bpimmedia -mediaid 000687 -L Expire the tape bpexpdate -ev 000687 -d 0 -host scorpius -force Get the status and pool number of the tape vmquery -m 000687 Deassign the tape vmquery -deassignbyid

vmquery -deassignbyid 000687 4 0x0 Delete the tape vmdelete -m 000687

Installing the Netbackup Client /update_clients -ForceInstall -ClientList /tmp/clients.lst requires that TMPDIR and TEMPDIR be set correctly Excludng files from backup on a client Create /usr/openv/netbackup/exclude_list Put the file specifications of the files/directories to be excluded /mnt/directory/* Displaying Information about a Tape vmquery -m --> Displays attributes about a particular tape bpmedialist -U -mcontents -ev 000687 --> Displays media contents bpmedialist -U -mlist --> List of all media bpmedialist -U -mlist -ev CYM966 --> Listing of a particular media id bpimmedia -mediaid 000687 -L --> Listing of images on a tape

Robtest Commands Starting robtest robtest 1 --> to select TLD 0 Getting help ? Looking at contents of the tape drives sd Looking at the contents of the library ss Moving a tape from a drive to a library slot s d --> to identify drive number that has tape (Contains Cartridge = yes, Barcode=XXXXXX) s s --> to identify an empty slot in the tape library (Netbackup will need to be re-inventoried) m d# s# --> from from drive # to slot # s d --> verify the tape drive is empty s s --> verify the library slot has the tape

Configuration Files /usr/openv/netbackup/bp.conf configuration file, sets backup server and backup clients force statement must be correct client to browse from client to restore to /usr/openv/volmgr/vmconf

Logfiles To utilize logfiles, create the corresponding directory in /usr/openv/netbackup/logs Server Logfile directories: admin - adminstrative commands bpbrm - backup and restore manager bpcd - client daemon bpdbjobs - database manager program process bpdm - disk manager process bpjava-msvc - Java application server authentication service bpjava-usvc - process that services Java requests bprd - request daemon process bpsched - scheduler process that runs on master servers bptm - tape/optical media management process user-ops - required directory for use by Java programs xbpadm - X based administration utility xbpmon - X based job monitor process Client Logfile directories: bp - client user interface process bparchive - archive program bpbackup - backup program bpbkar - program that generates golden images bpcd - client daemon bpjava-msvc - Java application server authentication service bpjava-usvc - process that services Java requests bplist - program that lists backed up and archived files bpmount - program that determines local mountpoints and wildcard expansion for multiple streams bphdb - Oracle database backup program start process

db_log - database specific extension log tar - tar process log during restores user_ops Media Manager logging automatically goes to the system log using syslogd logging facility

.Logging will only occur if these directories are created. These directories will generate a lot of data and should be deleted w To increase the amount of logging information set VERBOSE=2 in /usr/open/netbackup/bp.conf (default is VERBOSE=1)

Processes ltid acsd vmd

Useful Commands bpcllist - list classes bpclinfo -L --> displays info about a class vmpool - volume pools vmpool -listall vmpool -listscratch bplabel -ev -d hcart bpbackup db --> backs up the catalog bpclclients --> lists the clients for a particular policy (class)

Troubleshooting bperror -statuscode this will bring up drive 0 if it's control shows as down Look for pending requests /usr/openv/volmgr/bin/vmoprcmd or gui --> device management If there is a pending request either re-assign it to a drive, or deny the request Downed drive does not come back up or does not stay up

Check for a hardware problem by looking for messages on the tape library Make sure there is not a tape stuck in the drive Use robtest (described above) to look at the drives If there is a tape stuck in the drive, try to remove it using robtest If robtest fails, then you must manually remove it. Verify the Client is communicating properly: bpclncmd -ip --> from both client and server bpclntcmd -hn --> from both client and server bpclntcmd -pn --> from client only

Device Actions Device Management --> info about tape drives dlt hcart (ultrium)

Media Actions Media id must agree with # of the tape Create a media id actions -->new-->single volume-->dlt cart (not dlt2) put it into the "netbackup" volume pool

Netbackup Client To check things out do this: It could be a couple things. Mostly DNS, bp.conf, or something stupid. On the client run this command /usr/openv/netbackup/bin/bpclntcmd -pn /usr/openv/netbackup/bin/bpclntcmd -server "server name" /usr/openv/netbackup/bin/bpclntcmd ip "ip_address"

One of these usually fails and your able to fix it right off

1074 ./bpclntcmd -hn corpbu1 1075 ./bpclntcmd -ip 10.194.1.129 1076 ping 10.194.1.129 1077 ./bpclntcmd -hn corpldv1 1078 ./bpclntcmd -hn corpbu1.corporate.vox.net 1079 ping corpldv1 1080 ./bpclntcmd -ip 10.194.1.120

Must be able to resolve correctly from the master server and the client or it will not work!!!

Definition Command Line Interface Graphical User Interface

essarily graceful

================="

here robot_type is tld, acs, . . .) robot_number -list (where robot_type is tld, acs, . . .)

-rc1 -p -mm

nd should be deleted when no longer necessary.

ult is VERBOSE=1)

View more...

Comments

Copyright ©2017 KUPDF Inc.
SUPPORT KUPDF