Training Terminology
Home Up Zoo Regions

 

Glossary of Animal Training Terminology

Compiled by the Animal Behavioral Management Committee
of the American Association of Zookeepers, Inc.

Approximation - One small step, in a series of progressive steps that lead to the behavioral goal.

Aversive Stimulus - A stimulus whose termination or removal causes an increase in the frequency of behavior.

Avoidance Behavior - A behavior that increases in frequency because it postpones the delivery of an aversive stimulus (and is therefore negatively reinforced).

Balance of Reinforcement - The relationship between the reinforcing value of an action, whether positive or negative, with the percentage of positive vs. negative reinforcement given following a specific act. For example, the balance of reinforcement would positive if the behavior is followed by a positive consequence a high percentage of the time.

Behavioral Criterion - The level or behavioral response that must be met to earn reinforcement.

Behavioral Drift- Situation in which a previously trained behavior changes slightly over time, often due to slight inconsistencies in the trainers' cues, criteria, and reinforcement schedules.

Bridging Stimulus - A stimulus that can act as both an SD and a secondary reinforcer. It pinpoints the exact moment of reinforcement in a desired behavior. The "bridge" (often a clicker, whistle or word) communicates to the subject that it has performed correctly and signals that reinforcement is forthcoming. It "bridges" the gap between the time the correct response is given and the time the reinforcer is delivered.

Capture (Scan) - To place a naturally occurring behavior (or Unconditioned Response) under stimulus control by reinforcing the behavior as it occurs.

Chain Behavior - An individual response linking two or more individual behaviors in a specific order, elicited by a common discriminative stimulus (SD), or other stimulus. Each response acts as a conditioned reinforcer for the previous response and also provides the stimulus for the next behavior in the chain. The behavior is typically bridged at the end of the chain; the bridge serves as a conditioned reinforcer for the entire chain.

Classical Conditioning (Pavlovian or Respondent Conditioning) - A basic form of learning in which stimuli initially incapable of evoking a reflex response acquire the ability to do so through repeated pairing with other stimuli that are able to elicit that response; a method of behavior modification where the presentation of an unconditioned stimulus (US) causes an unconditioned response (UR), which is emitted reflexively. This type of conditioning does not involve any voluntary choices by the animal; the response or reaction is reflexive.

Conditioned Response (CR) - A type of learned response that occurs following the presentation of a specific stimulus.

Conditioned Stimulus (CS) - A signal that will elicit a specific response as a result of a learned association between that stimulus and that response.

Conditioning - A form of learning in which a change in behavior is due to changes or influences in the environment; the operation of associating an act or object with a stimulus to modify or mold behaviors.

Continuous Reinforcement - A schedule of reinforcement in which each correct response is followed by a positive reinforcer.

Desensitization - The lessening or disappearance of a (often negative) response through the pairing of a positive reinforcer with the negative stimulus or event; the process of turning an unfamiliar object or even (usually negative) toward a neutral position. The object or event can then be paired with positive reinforcers to make it positive.

Differential Reinforcement (Selective Reinforcement) - The act of selectively reinforcing desirable responses to shape the topography of a specific behavior.

Differential Reinforcement of an Incompatible Behavior (DRI) - Schedule of reinforcement used to cause a decrease in the frequency of an undesirable behavior, in which the subject is reinforced for performing a behavior that competes, or is incompatible with, the undesirable behavior.

Differential Reinforcement of Other Behavior (DRO) - Schedule of reinforcement used to cause a decrease in the frequency of an undesirable behavior, in which the subject is reinforced for emitting any response other than the undesirable behavior.

Discrimination - The ability, acquired through learning, to make distinctions between stimuli that are similar but not exactly alike.

Discriminative Stimulus (SD, Cue, Signal) - A stimulus that precedes a behavior, signaling that a specific response will be reinforced if emitted correctly. The result is that the stimulus will consistently elicit that particular response.

Extinction - Process in which reinforcement is no longer delivered following a specific, previously reinforced behavior. Eventually the behavior dies out due to lack of reinforcement.

Extinction Burst - The increase in a response that is brought on by the withdrawal of reinforcement. Extinction burst occurs just prior to the decline of behavior (due to lack of reinforcement) prior to extinction.

Fixed Interval Schedule of Reinforcement - A schedule of reinforcement in which reinforcement follows the first correct response after a set amount of time has elapsed since previous reinforcement. For example, if an animal is reinforced every 5 seconds (assuming the last response is correct), this is expressed as FI=5 seconds.

Fixed Ratio Schedule of Reinforcement - A schedule of reinforcement in which reinforcement follows the correct response after a set number of responses. For example, if every second response is reinforced, FR=2.

Generalization - The lack of discrimination between two stimuli. An animal that has been conditioned to respond to a specific stimulus may offer the same response in the presence of a similar stimulus.

Incompatible Behavior - A behavior that is impossible to perform at the same time as another specific behavior. For example, running and sitting are incompatible behaviors as one subject cannot perform both simultaneously.

Jackpot (Bonus) - A positive reinforcer that is much larger than usual, and usually unexpected.

Latency - The interval of time between the stimulus and response.

Least Reinforcing Stimulus - All interaction or reinforcement, positive or negative, is withheld for a period of time (generally a few seconds) so as not to reinforce an undesirable behavior or lack of response. Also termed Least Reinforcing Scenario, serving as a stimulus for calm behavior following an incorrect response, which helps to reduce frustration that might result from a lack of reinforcement.

Magnitude of Reinforcement - Refers to the size, strength, and duration of the reinforcement following a behavior.

Negative Reinforcement - An act that increases the frequency of a behavior by providing an aversive stimulus that the subject will work to avoid or escape. The frequency of behavior increases to avoid the onset of or to terminate the aversive stimulus.

Observational Learning - A type of learning in which the behavior of another organism is observed and imitated.

Operant Conditioning - A type of learning in which behavior is determined by its consequences. A positive consequence to a behavior will increase the likelihood of the behavior being repeated, while an undesirable consequence will decrease the likelihood of the behavior being repeated.

Positive Reinforcement - The addition of a desirable stimulus to the subject's environment following a behavior, thereby causing an increase in the frequency occurrence of that behavior.

Primary Reinforcer (Unconditioned Reinforcer) - A reinforcing event that does not depend on learning or previous experience to achieve its reinforcing properties (e.g. biological need: food, water, sex).

Punishment - An act that occurs immediately after an undesirable behavior it is meant to affect, and causes a decrease in the frequency of that behavior. This can be in the form of the administration of an aversive stimulus or event (positive punishment) or the removal of a desirable stimulus (negative punishment); Anything that decrease the frequency of the behavior it immediately follows.

Regression - The state of a previously conditioned behavior reverting back to a lower stage of development.

Reinforcer - An object, action or other stimulus that causes an increase in the frequency of a response.

Schedule of Reinforcement - The conditions under which reinforcement is delivered.

Secondary Reinforcer (Conditioned Reinforcer) - An object or event that initially may mean nothing to the animal but becomes reinforcing through pairing with a primary reinforcer.

Shaping - An operant conditioning method of taking an action or tendency and shifting it, one approximation at a time (successive approximations), toward the final behavioral goal.

Stimulus Control - Once a behavior is shaped, it is said to be under stimulus control if it meets 3 conditions: 1) It is immediately offered following the SD; 2) It is offered only when preceded by the correct SD; and 3) It is not offered in the presence of another SD.

Superstitious Behavior - A behavior that is unrelated to the behavior being conditioned but has inadvertently been reinforced often enough that it becomes fixed in the subject's mind as necessary in order to receive reinforcement.

Target (noun) - A prop (or position) which pinpoints a specific location for an animal to touch, approach, or follow, e.g. a trainer teaches a rhino to touch its lip to the end of a stick. Using this technique to shape a behavior is called "targeting."

Target (verb) - The act of an animal touching a predetermined target, (spot, point or prop) with a designated body part.

Time-out - Type of mild punishment (negative punishment) in which the trainer ceases to provide stimuli or responses (i.e. removes him/herself and all other opportunity for reinforcement from the training situation) for a period of time with the intent of causing a decrease in the frequency or occurrence of an undesired response.

Unconditioned Stimulus (US) - In Classical Conditioning, any stimulus that can elicit reactions from a subject with out prior conditioning.

Variable Reinforcement - A reinforcement schedule in which the timing and/or delivery of the reinforcer is unpredictable.

Variable Interval Schedule of Reinforcement - A reinforcement schedule in which the reinforcement follows the first correct response after a varying amount of time. For example, if an animal is reinforced on the average of every 4 seconds, we say that VI=4.

Variable Ratio Schedule of Reinforcement - A reinforcement schedule in which reinforcement follows the first correct response after a varying number of responses. This is usually based on an average number of responses to be reinforced. For example, if the average is to be every third response (VR=3), the number of behaviors requested would average 3 between each reinforcement.