About my Blog

This blog will help the people, who are interested in Learning Teradata basics in deep.. And it will be helpful for Certification and interview perspective also..

By Santhosh.B

Tuesday, 30 October 2012

Teradata Architecture


Basically Teradata architecture have three components..
those are:
1) PE(Parser Engine)
2)BYNET
3) AMP(Access Module Processor).


PE-parser Engine
  • Heart of the Teradata.
  • The Parsing Engines are perfectly balanced, with each having the capability to handle up to 120 users at a time.
  • This could be 120 distinct users or a single user utilizing the power of all 120 sessions for a single application.
  • That is why there are multiple PE’s in every Teradata system.
  • Each PE has total command over every AMP.
  • Each PE will take users SQL and do three things:
1.Syntax check - check the users SQL syntax.
2.Security Check - check the users ACCESS RIGHTS.
3.Plan - PLAN to satisfy the user request.
  • The fastest plan is a Single-AMP retrieve.
  • The second fastest plan is a Two-AMP retrieve.
  • The next fastest plan will be all AMPs reading only a portion of the table, and The slowest plan is the full table scan. That is where each AMP reads every row they contain for a table.

AMP(Access Module Processor)

  • Each PE rules them all because the rows of every table are spread across all the AMPs.
  •  AMPs organize every table in separate blocks.
  • PE passes the PLAN to the AMPs over the BYNET.
  •  When a table is first created each AMP creates a table header on their disk.
  •  When the table is loaded each AMP receives rows for that table that they and only they own.
  • They carefully place the rows inside data blocks where they can easily be retrieved.

BYNET

  • The PE comes up with a PLAN and passes the plan to the AMPs in steps over the BYNET.
  • AMPs then retrieve the data requested by the PE and they deliver their portion of the answer set to the PE over the BYNET.
  • BYNET provides the communications between AMPs and Pes.
  • There are always two BYNETs for redundancy and extra bandwidth. AMPs and PEs can use both BYNETs to send and retrieve data simultaneously.


Database and Logical Modeling?


 Database:
                A database is collection of permanently stored data used by an application or enterprise. 
A database contains logically related data, which means that the database was created purpose of mind.  
A database supports shared access by many users.
·Protected access to data is controlled.
·Managed data has integrity and value.
·Based on the relational model.
Logical Modeling:
Tables are logically created for all database systems.
The logical model should be independent of usage. A variety of front end tools can be accommodated simultaneously so that the database can be created more quickly. Teradata supports normalized logical models, because we are able to perform 64 table joins and we are to perform large aggregations during queries.
A key Teradata strength is our ability to model the business of customers. Teradata Business models are truly normalized avoiding the costly star schema, snowflake. Teradata can do star schema and other types of relational modeling, but 3NF is recommended.

Teradata Advantages

·Automatic, Even data distribution.
·High scalability.
·Mature optimizer (complex queries, 64 joins, ad-hoc processing).
·Model the business: 3NF, Star schema..etc.
·Lowest TCO (Total Cost of Ownership):  Easy to install, Easy to work, Easy to manage and robust utilities.
·Acts like Single DataStore.

  • Many bulk load utilities: BTEQ, FASTLOAD, MULTILOAD, TPUMP, FAST EXPORT…

why Teradata?

·When compare other RDBMS it is good in performance wise, because it is shared nothing architecture.
·It can store billions of rows.
·Unconditional parallelism.
·Using Indexes for better storing and fast retrieval.
·Supports easy scalability from small (10GB) to a massive database (100+TB).
·System to grow to support more users/data/queries/complexity of queries without experiencing performance degradation.
·Provides a parallel aware optimizer that makes query tuning unnecessary to get a query to run.
·And Optimizer determines the least expensive plan (time-wise) to process queries fast and in parallel.
·Automatic and even distribution avoiding complex indexing schemes or time consuming
reorganizations.Single operational view of the entire MPP (massively parallel processing) system and single point of control for the DBA (TD manager).

Monday, 29 October 2012

What is Teradata(TD)?

·TD is a RDBMS in DW Environment.
·TD is an open system means that it is platform independent and satisfies industry standards.
·Compatible with industry ANSI standards.
·And it is currently available for the UNIX and WINDOWS operating system.
In this, for Teradata 13.0 version will support only windows7 32bit only. If you want to install in windows 7 64bit you should use VMvare.
And TD13.0 version is not supporting for windows XP. TD 12.0 version will install in XP o.s.

Check bellow table once.

Windows 7 64bit
Windows 7 32bit
Windows XP
TD 13.0
Use  VMvare
It will support
Use VMvare
TD 12.0
VMvare
VMvare
It will support
·It will run on single or multiple nodes or severs.
·And it can act’s like a server.
·Built in parallelism (for this I will give you deep explanation in coming posts).
·Client platforms access the database through TCP-IP connection or across an IBM mainframe channel connection.
·Large database server.
·The Teradata Database was the first commercial database system to support a trillion bytes of data. 10^12= 1,000,000,000,000 (Trillion) bytes.
·Built on a parallel architecture.